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SEED PIANTS CHARACTERIZED BY DEIAYED SEED DISPERSAL 

This invention was made with government support 
under DCB901874 9 awarded by the National Science 
Foundation. The government has certain rights in the 
5 invention. 



BACKGROUND OF THE INVENTION 

FIELD OF THE INVENTION 



The present invention relates generally to 
plant molecular biology and genetic engineering and more 
10 specifically to the production of genetically modified 

seed plants in which the natural process of dehiscence is 
delayed. 



BACKGROUND INFORMATION 



Rapeseed is one of the most important oilseed 
15 crops after soybeans and cottonseed, representing 10% of 
the world oilseed production in. 1990. Rapeseed 
contains 40% oil, which is pressed from the seed, leaving 
a high-protein seed meal of value for animal feed and 
nitrogen fertilizer. Rapeseed oil, also known as canola 
20 oil, is a valuable product, representing the fourth most 
commonly traded vegetable oil in the world. 

The production of oilseeds, meal and oil from 
rapeseed plants has been increasing continuously for the 
last 30 years for food and feed grains, mainly by 
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expansion of the area under cultivation. Most northern 
European countries produce rapeseed as their main edible 
oil crop. By the year 2000, China is expected to be the 
leading producer with 9.2 metric tons (Mt; 26%); followed 
5 by India with 7.8 Mt (22%); the European Community (12 
countries), with 7.6 Mt (21%); Canada, 3.8 Mt (11%) and 
eastern Europe with 2.6 Mt (7%). 



Unfortunately, the yield of seed from rapeseed 
and related plants is limited by pod dehiscence, which is 

10 a process that occurs late in fruit development whereby 
the pod is opened and the enclosed seeds released. 
Degradation and separation of cell walls along a discrete 
layer of cells dividing the two halves of the pod, termed 
the "dehiscence zone, " result in separation of the two 

15 halves of the pod and release of the contained seeds- 
Seed "shattering," whereby seeds are prematurely shed 
through dehiscence before the crop can be harvested, is a 
significant problem faced by commercial seed producers 
and represents a loss of income to the industry. Adverse 

20 weather conditions can exacerbate the process of 

dehiscence, resulting in greater than bOl loss of seed 
yield . 



Attempts to solve this problem over the past 20 
years have focused on the breeding of shatter-resistant 

25 varieties. However, these plant hybrids are frequently 
sterile and lose favorable characteristics that must be 
regained. by backcrossing, which is both time-consuming 
and laborious. Other strategies to alleviate pod 
shattering include the use of chemicals such as pod 

30 sealants or mechanical techniques such as swathing to 

reduce wind-stimulated shattering. To date, however, a 
simple method for producing genetically modified seed 
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plants that do not open and release their seeds 
prematurely has not been described. 

Thus, a need exists for identifying genes that 
regulate the dehiscence process and for developing 
5 genetically modified seed plant varieties in which the 
natural seed dispersal process is delayed. The present 
invention satisfies this need and provides related 
advantages as well. 

SUMMARY OF THE INVENTION 

The present invention provides a non-naturally 
occurring seed plant that is characterized by delayed 
seed dispersal due to ectopic expression of a nucleic 
acid molecule encoding an AGL8-like gene product. The 
AGL8-like gene product can have, for example, 
substantially the amino acid sequence of an AGL8 ortholog 
such as Arabidopsis AGL8 (SEQ ID N0:2). Particularly 
useful seed plants of the invention, which are 
characterized by delayed seed dispersal, include members 
of the Brassicaceae, such as rapeseed, and members of the 
Fabaceae, such as soybeans, peas, lentils and beans. 

In one embodiment^ the invention provides a 
transgenic seed plant that is characterized by delayed 
seed dispersal due to ectopic expression of a nucleic 
acid molecule encoding an AGL8-like gene product. In a 
25 transgenic seed plant of the invention, the nucleic acid 
molecule encoding the AGL8-like gene product can be 
operatively linked to an exogenous regulatory element. 
Useful exogenous regulatory elements include constitutive 
regulatory elements and dehiscence zone-selective 
30 regulatory elements. In particular, the exogenous 

regulatory element can be a dehiscence zone-selective 



10 



15 



20 
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regulatory element that is an AGLl regulatory element or 
an AGL5 regulatory element. 

In another embodiment, the invention provides a 
non-naturally occurring seed plant that is characterized 
5 by delayed seed dispersal due to suppression of both AGLl 
and AGL5 expression in the seed plant. Such a 
non-naturally occurring seed plant characterized by 
delayed seed dispersal can be, for example, an agii agl5 
double mutant. 

10 The present invention further provides a tissue 

derived from a non-naturally occurring seed plant of the 
invention. In one embodiment, the invention provides a 
tissue derived from a non-naturally occurring seed plant 
that has an ectopically expressed nucleic acid molecule 

15 encoding an AGL8-like gene product and is characterized ' 
by delayed seed dispersal. In another embodiment, the 
invention provides a tissue derived from a non-naturally 
occurring seed plant in which AGLl expression and AGL5 
expression each are suppressed, where the seed plant is 

20 characterized by delayed seed dispersal. 



25 



Methods of producing a non-naturally occurring 
seed plant characterized by delayed seed dispersal also 
are provided herein. Such methods entail ectopically 
expressing a nucleic acid molecule encoding an AGL8-like 
gene product in the seed plant, whereby seed dispersal is 
delayed due to ectopic expression of the nucleic acid 
molecule . 



30 



The invention also provides a substantially 
purified dehiscence zone-selective regulatory element, 
comprising a nucleotide sequence that confers selective 
expression upon an operatively linked nucleic acid 
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molecule in the valve margin or dehiscence zone of a seed 
plant, provided that the dehiscence zone-selective 
regulatory element does not have a nucleotide sequence ■ 
consisting of nucleotides 1889 to 2703 of SEQ ID NO: 4. 
5 The dehiscence zone-selective regulatory element can be, 
for example, an AGLl regulatory element or AGL5 
regulatory element . 



Further provided is a plant expression vector 
containing a dehiscence zone-selective regulatory element 

10 that confers selective expression upon an operatively 
linked nucleic acid molecule in the valve margin or 
dehiscence zone of a seed plant, provided that the 
dehiscence zone-selective regulatory element does not 
have a nucleotide sequence consisting of nucleotides 1889 

15 to 2703 of SEQ ID NO: 4. If desired, a plant expression 
vector can contain a nucleic acid molecule encoding an 
AGL8-like gene product in addition to the dehiscence 
zone-selective regulatory element. 



The invention also provides a kit for producing 
20 a transgenic seed plant characterized by delayed seed 
dispersal, such kit containing a dehiscence 
zone-selective regulatory element that confers selective 
expression upon an operatively linked nucleic acid 
molecule in the valve margin or dehiscence zone of a sfeed 
25 plant, provided that said dehiscence zone-selective 

regulatory element does not have a nucleotide sequence 
consisting of nucleotides 1889 to 2703 of SEQ ID NO: 4, 
In a kit of the invention, the dehiscence zone-selective 
regulatory element can be, if desired, operatively linked 
30 to a nucleic acid molecule encoding an AGLB-like gene 
product . 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a scanning electron micrograph 
of an Arabldopsis gynoecium at about the time of 
pollination. A number of distinct cell types are shown^ 
5 including the apical stigma, the style, and the ovary* 
The ovary walls, or valves, which are separated along 
their entire lengths by a small suture denoted the 
"replum, *' are indicated. The dehiscence zone, a narrow 
band of cells one to three cells wide along the 
10 valve/replum boundary, also is indicated. 

Figure 2 shows a wild type Arabidopsis fruit 
immediately following pod shattering. The seeds as well 
as the replum are clearly visible. 

Figure 3 shows scanning electron micrographs of 
15 wild type Arabidopsis and a representative 35S: :AGL8 

transgenic line. The dehiscence zone is evident in the 
wild type plant. In contrast, in the 35S::AGL8 
transgenic line, the cells of the outer replum are 
converted to a valve cell fate, and the dehiscence zone 
20 " is absent . 

Figure 4 shows the agl5 and agll genomic 
regions and the loss of AGL5 or AGLl expression, 
respectively, in the agl5 or agJl mutant. Figure 4A 
shows the genomic structure of the AGL5 gene, with the 

25 positions of exons indicated by boxes, and the positions 
of introns indicated by thin lines. The agl5 mutant 
allele, generated by targeted disruption. following 
homologous recombination, has a kanamycin resistance 
cassette that is indicated by a yellow hatched box and 

30 located within the MADS-box region. Figure 4B shows the 
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genomic structure of the AGLl gene, with the position of 
the approximately 17 kb T-DNA insertion into the large 
intron of the agll~l locus indicated by the arrowhead. 
Exons are indicated by boxes. Introns are indicated by 
5 thin lines. The MADS-box region is shown as a hatched 
box. Figure 4C shows that a probe specific for the 3* 
end of the AGL5 complementary cDNA detected the AGL5 
transcript in wild type but not in the agl5 knockout 
mutant plants. Figure 4D shows that a probe specific for 
10 the 3' end of the AGLl complementary DNA (cDNA) detected 
the AGLl transcript in wild type but not in the agll 
mutant generated by T-DNA insertion. 

Figure 5 shows scanning electron micrographs of 
wild type Arabidopsis and an agll agl5 double mutant. 
The valves are beginning to detach from the replum in the 
wild type Arabidops is fruits, which are shown during the 
process of dehiscence. At the same time in development, 
the valves of the agll agl5 double mutant plant remain 
attached to the replum. 

Figure 6 sho\^s the nucleotide (SEQ ID ^30:l) and 
amino acid (SEQ ID NO: 2) sequence of Arabidopsis AGL8 . 

Figure 7 shows the nucleotide sequence of the 
Arabidopsis AGLl gene (SEQ ID NO: 3). The exons and 
25 translation start site are indicated. 

Figure 8 shows the nucleotide sequence of the 
Arabidopsis AGL5 gene (SEQ ID N0:4). The exons and 
translation start site are indicated. 



15 



20 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a non-naturally 
occurring seed plant that is characterized by delayed 
seed dispersal due to ectopic expression of a nucleic 
5 acid molecule encoding an AGL8-like gene product. The 
AGL8-like gene product can have, for example, 
substantially the amino acid sequence of an AGL8 ortholog 
such a.s Arabidopsls AGL8 (SEQ ID NO:2). 

The fruit, a complex structure unique to 
10 flowering plants, mediates the maturation and dispersal 
of seeds. In most flowering plants, the fruit consists 
of the pericarp, which is derived from the ovary wall, 
and the seeds, which develop from fertilized ovules. 
Arabidopsls ^ which is typical of the more than 3000 
15 species of the Brassicaceae , produces fruit in which the 
two carpel valves (ovary walls) are joined to the replum, 
a visible suture that divides the two carpels. The 
structure of an Arabidopsls gynoecium around the time of 
pollination, including the carpel valves. and replum, is 
20 shown in Figure 1. 

Pod dehiscence or shatter occurs late in fruit 
development in a wide spectrum of important plant crops 
such as oilseed rape (Brassica napus L.) and is a process 
of economic importance that can lead to significant 

25 losses in seed yield. In oilseed rape, dehiscence 
involves the breakdown of cell wall material in a 
discrete cell layer known as the "dehiscence zone," which 
is a region of only one to three cells in width that 
extends along the entire length of the valve/replum 

30 boundary (Meakin and Roberts, J. Exp. Botany 41:995-1002 
(1990) ) . As the cells in the dehiscence zone separate 
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from one another, the valves detach from the replum, 

allowing seeds to be dispersed (see Figure 2). 



The plant hormone ethylene is produced by 
developing seeds and appears to be an important regulator 
5 of the dehiscence process. One line of evidence 
supporting a role for ethylene in regulation of 
dehiscence comes from studies of fruit ripening, which, 
like fruit dehiscence, is a process involving the 
breakdown of cell wall material. In fruit ripening, 

10 ethylene acts in part by activating cell wall degrading 
enzymes such as polygalacturonase (Theologis et al*. 
Develop. Genet ics 1^:282-295 (1993)). Moreover, in 
genetically modified tomato plants in which the ethylene 
response is blocked, such as transgenic tomato plants 

15 expressing antisense polygalacturonase, there is a 

significant delay in fruit ripening (Lanahan et al.. The . 
Plant Cell 6:521-530 (1994); Smith et al.. Nature 
334 :724-726 (1988) ) . 



In dehiscence, ult rastructural cha.nges that 
20 culminate in degradation of the middle lamella of 

dehiscence zone cell walls weaken rapeseed pods and 
eventually lead to pod shatter. As in fruit ripening, 
hydrolytic enzymes including polygalacturonases play a 
role in this programmed breakdown. For example, in 
25 oilseed rape, a specific endo-polygalacturonase, RDPGl, 
is upregulated and expressed exclusively in the 
dehiscence zone late in pod development (Petersen et al.. 
Plant Mol. Biol. 31:517-527 (1996), which is incorporated 
herein by reference) . Ethylene may regulate the activity 
30 of hydrolytic enzymes involved in the process of 

dehiscence as it does in fruit ripening (Meakin and 
Roberts, J. Exo. Botany 41:1003-1011 (1990), which is 
incorporated herein by reference) . Yet, until now, the 
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proteins that control the process of dehiscence, such as 
those regulating the relevant hydrolytic enzymes, have 
eluded identification. 



The present invention is directed to the 
5 surprising discovery that the AGL8 transcription factor 
regulates the process of dehiscence. As disclosed 
herein, Arabldopsls plants were transformed with an AGL8 
cDNA under control of a 35S cauliflower mosaic virus 
(CaMV) constitutive promoter such that AGL8 was 

10 ectopically expressed throughout the transformed plant. 
In particular, AGL8 , which is normally expressed in the 
carpel valves, was ectopically expressed in the replum, 
which is a small strip of cells separating the two valves 
in a mature fruit. As a consequence of such ectopic 

15 expression, the replum of the fruit was absent, with the 
cells of the outer replum replaced by cells having 
characteristics of valve identity, demonstrating thar, in 
this context, AGL8 expression is sufficient to specify 
valve cell fate. Furthermore, ectopic expression of the 

20 AGL8 cDNA piroduced a transgenic plant in which the 

dehiscence zone failed to develop normally, resulting in 
delayed seed dispersal (see Example I) . Whereas wild 
type Arabldopsls produced fruit that opened and released 
seeds on or about 1^ days after pollination, transformed 

25 Arabldopsls ectopically expressing AGL8 produced fruit in 
which seed dispersal was postponed, or in which the seeds 
were never released unless the fruit was opened manually 
(see Figure 3) . Thus, for the first time, seed plants 
were genetically modified to delay the natural process of 

30 dehiscence. 



The present invention also relates to the 
surprising discovery that an agiJ agl5 double mutant seed 
plant has a delayed seed dispersal phenotype that is 
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Strikingly similar to the AGL8 gain-of -function 
phenotype. As disclosed herein, loss-of -function 
mutations in the AGLl and AGL5 genes were produced by 
disruptive T-DNA insertion and homologous recombination 
5 (see Example II). In the resulting agJi agl5 double 
mutant plants, the dehiscence zone failed to develop 
normally, and the mature fruits did not undergo 
dehiscence (see Figure 5). Thus, AGLl or AGL5 gene 
expression is required for development of the dehiscence 
10 zone. These results indicate that AGLl , AGL5 and AGL8 
regulate pod dehiscence and that manipulation of AGLl, 
AGL5 and AGL8 expression can allow the process of pod 
shatter to be controlled. 

Thus, the present invention provides a 
non-nat urally occurring seed plant that is characterized 
by delayed seed dispersal due to ectopic expression of a 
nucleic acid molecule encoding an AGL8-like gene product. 
The AGL6-like gene product can have, for example, 
substantially the amino acid sequence of an AGL8 ortholog 
such Arabidopsis AGL8 (SEQ ID NO:2). 

As used herein, the term "non-naturally 
occurring," when used in reference to a seed plant, means 
a seed plant that has been genetically modified by man. 
A transgenic seed plant of the invention, for example, is 
25 a non-naturally occurring seed plant that contains an 
exogenous nucleic acid molecule encoding an AGL8-like 
gene product and, therefore, has been genetically 
modified by man. In addition, a seed plant that 
contains, for example, a mutation in an endogenous 
30 AGL8-like gene product regulatory element or coding 
sequence as a result of calculated exposure to a 
mutagenic agent, such as a chemical mutagen, or an 
"insertional mutagen, " such as a transposon, also is 
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considered a non-naturally occurring seed plant, since it 
has been genetically modified by man. In contrast, a 
seed plant containing only spontaneous or naturally 
occurring mutations is not a "non-naturally occurring 
5 seed plant" as defined herein and, therefore, is not 

encompassed within the invention. One skilled in the art 
'understands that, while a non-naturally occurring seed 
plant typically has a nucleotide sequence that is altered 
as compared to a naturally occurring seed plant, a 
10 non-naturally occurring seed plant also can be 

genetically modified by man without altering its 
nucleotide sequence, for example, by modifying its 
met hy la t ion pattern . 



The term "ectopically , " as used herein in 

15 reference ro expression of a nucleic acid molecule 
encoding an AGL8-like gene product, refers to an 
expression pattern that is distinct from the expression 
pattern in a wild type seed plant. Thus, one skilled in 
the art understands that ectopic expression of a nucleic 

20 acid encoding an AGL8-like gene product can refer to 

expression in a cell type other than a cell type in which 
the nucleic acid molecule normally is expressed, or at a 
time other than a time at which the nucleic acid molecule 
normally is expressed, or at a level other than the level 

25 at which the nucleic acid molecule normally is expressed. 
In wild type Arabidopsis , for example, AGL8 expression is 
normally restricted during the later stages of floral 
development to the carpel valves and is not seen in the 
replum, which is the small strip of cells separating the 

30 carpel valves. However, under control of a constitutive 
promoter such as the cauliflower mosaic virus 35S 
promoter, AGL8 is expressed in the replum and, 
additionally, is expressed at higher than normal levels 



BNSDOCID <W 9SO0502A1 t 



wo 99/00502 PCT/US98/13208 

13 

in other tissues such as valve margin and, thus^ is 

ectopically expressed. 

The term "delayed, " as used herein in reference 
5 to the timing of seed dispersal in a fruit produced by a 
non-naturally occurring seed plant of the invention, 
means a significantly later time of seed dispersal as 
compared to the time seeds normally are dispersed from a 
corresponding seed plant lacking an ectopically expressed 

10 nucleic acid molecule encoding an AGL8-like gene product. 
Thus, the term "delayed" is used broadly to encompass 
both seed dispersal that is significantly postponed as 
compared to the seed dispersal in a corresponding seed 
plant, and to seed dispersal that is completely 

15 precluded, such that fruits never release their seeds 
unless there is human or other intervention. 

It is recognized that there can be natural 
variation of the time of seed dispersal within a seed 
plant species or variety. However, a "delay" in the time 

20 of seed dispersal in a non-na t ura lly occurring seed plant 
of the invention readily can be* identified by sampling a 
population of the non-naturally occurring seed plants and 
determining that the normal distribution of seed 
dispersal times is significantly later, on average, than 

25 the normal distribution of seed dispersal times in a 
population of the corresponding seed plant species or 
variety that does not contain an ectopically expressed 
nucleic acid molecule encoding an AGL8-like gene product. 
Thus, production of non-naturally occurring seed plants 

30 of the invention provides a means to skew the normal 
distribution of the time of seed dispersal from 
pollination, such that seeds are dispersed, on average, 
at least about 1%, 2%, 5%, 10%, 30%, 50% or 100% later 
than in the corresponding seed plant species that does 
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not contain an ectopically expressed nucleic acid 
molecule encoding an AGL8-like gene product. 

A delay in seed dispersal of even one to two 
days can be valuable in increasing the amount of seed 
5 successfully harvested from a seed plant. In canola 

rapeseed, for example, dehiscence normally occurs about 8 
weeks post-pollination. In a non-naturally occurring 
canola rapeseed that ectopically expresses an AGL8-like 
gene product, dehiscence can occur one to two days later 
10 than in the wild type variety, allowing a significantly 
greater percentage of the seed crop to be harvested 
rather than lost through uncontrolled seed dispersal. 

The present invention relates to the use of 

15 nucleic acid molecules encoding particular "AGAMOUS-LIKE" 
or "AGL" gene products. AGAMOUS (AG) is a floral organ 
identity gene, one of a related family of transcription 
factors that, in various combinations, specify the 
identity of the floral organs: the petals, sepals, 

20 stamens and carpels (Bowman et al., Devel . 112:1-20 

(1991); Weigel and Meyerowitz, Cell 78:203-209 (1994); 
Yanofsky, Annual Rev. Plant Physiol. Mol . Biol. 
46:167-188 (1995)). The AGAMOUS gene product is 
essential for specification of carpel and stamen identity 

25 (Bowman et al.. The Plant Cell 1:37-52 (1989); Yanofsky 
et al,. Nature 346:35-39 (1990)). Related genes have 
recently been identified and denoted "AGAMOUS-LIKE" or 
"AGL" genes (Ma et al.. Genes Devel. 5:484-495 (1991); 
Mandel and Yanofsky, The Plant Cell 7:1763-1771 (1995), 

30 which is incorporated herein by reference) . 

AGL8, like AGAMOUS and other AGL genes, is 
characterized, in part, in that it is a plant MADS box 
gene. The plant MADS box genes generally encode proteins 
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of about 260 amino acids including a highly conserved 
MADS domain of about 56 amino acids (Riechmann and 
Meyerowitz, Biol. Chem. 378:1079-1101 (1997), which is 
incorporated herein by reference) . The MADS domain, 
5 which was first identified in the Arabidopsis AGAMOUS and 
Antirrhimuin ma jus DEFICIENS genes, is conserved among- 
transcription factors found in humans (serum response 
factor; SRF) and yeast (MCMl; Norman et al., Cell 
55:989-1003 (1988); Passmore et al., J. Mol. Biol. 

10 204:593-606 (1988), and is the most highly conserved 

region of the MADS domain proteins. The MADS domain is 
the major determinant of sequence specific DNA-binding 
activity and can also perform dimerization and other 
accessory functions (Huang et al.. The Plant 

15 Cell 8:81-94 (1996)), The MADS domain frequently resides 
at the N-terminus, although some proteins contain 
additional residues N-terminal to the MADS domain. 



The "intervening domain" or "I-domain," located 
immediately C-terminal to the MADS domain, is a weakly 

20 conserved domain having a variable length of 

approximately 30 amino acids (Purugganan et al . , Genet ics 
140:345-356 (1995)). In some proteins, the I-domain 
plays a role in the formation of DNA-binding dimers. A 
third domain present in plant MADS domain proteins is a 

25 moderately conserved 70 amino acid region denoted the 
"keratin-like domain" or "K-domain." Named for its 
similarity to regions of the keratin molecule, the 
structure of the K-domain appears capable of forming 
amphipathic helices and may mediate protein-protein 

30 interactions (Ma et al.. Genes Devel. 5:484-495 (1991)). 
The most variable domain, both in sequence and in length, 
is the carboxy-terminal or "C-domain" of the MADS domain 
proteins. Dispensable for DMA binding and protein 
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dimeri zation in some MADS domain proteins^ the function 
of this C-domain remains unknown. 

Arabidopsis AGL8 is a 242 amino acid MADS box 
5 protein (see Figure 6; SEQ ID NO: 2; Mandel and Yanofsky, 
supra ^ 1995) . The AGL8 MADS domain resides at amino 
acids 2 to 56 of SEQ ID NO: 2. The K-domain of AGL8 
resides at amino acids 92 to 158 of SEQ ID N0:2. 

In wild-type Arabidopsis, AGL8 RNA accumulates 

10 in two distinct phases, the first occurring during 

inflorescence development in the stem and cauline leaves 
and the second in the later stages of flower development 
(Mandel and Yanofsky, supra, 1995) . In particular, AGL8 
RNA is first detected in the inflorescence meristem as 

15 soon as the plant switches from vegetative to 

reproductive development- As the inflorescence stem 
elongates, AGLS RNA accumulates in the inflorescence 
meristem and in the stem. Secondly, although AGLS is not 
detected in the initial stages (1 and 2) of flower 

20 development, AGLS expression resumes at approximately 
stage 3 in the center of the floral dome in the region 
corresponding to the fourth (carpel) whorl. AGLS 
expression is excluded from all other primordia and the 
pedicel. The time of AGLS expression in the fourth 

25 carpel whorl generally corresponds to the time at which 

the organ identity genes APETALA3 , PISTILLATA AND AGAMOUS 
begin to be expressed (Yanofsky et al.. Nature 346:35-39 
(1990); Drews et al., Cell 65:991-1002 (1991); Jack et 
al., Ce3 1 68:683-697 (1992); Goto and Meyerowitz, Genes 

30 Devel . 8:1548-1560 (1994)). At later stages, AGLS 

expression becomes localized to the carpel walls, in the 
region that constitutes the valves of the ovary, and is 
absent from nearly all other cell types of the carpel. 
No AGLS RNA expression is detected in the ovules. 
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stigmatic tissues or the septum that divides the ovary. 
Thus, in nature, AGL8 expression during the later stages 
of floral development is restricted to the valves of the 
carpels and to the cells within the style. 



5 As used herein, the term "AGL8-like gene 

product" means a gene product that has the same or 
similar function as Arabidopsis AGL8 such that, when 
ectopically expressed in a seed plant, the normal 
development of the dehiscence zone is altered, and seed 

10 dispersal is delayed. An AGL8-like gene product can 

have, for example, the ability to convert, cells of the 
outer replum to a valve cell identity. Arabidopsis AGL8 
(SEQ ID NO: 2) is an example of an AGL8-like gene product 
as defined herein. As disclosed in Example 1, ectopic 

15 expression of Arabidopsis AGL8 (SEQ ID NO: 2) under 
control of a tandem CaMV 35S promoter, in which the 
intrinsic promoter element has been duplicated, alters 
formation of the dehiscence zone, thereby resulting in 
fruit characterized by a complete lack of seed dispersal. 

20 An AGL8-like gene product also can be characterized, in 
part, by its ability to interact with AGLl and, 
additionally, its ability to interact with AGL5. 



An AGL8-like gene product generally is 
characterized, in part, by having an amino acid sequence 

25 that has at least about 50% amino acid identity with the 
amino acid sequence of Arabidopsis AGL8 (SEQ ID NO: 2) . 
An AGL8-like gene product can have, for example, an amino 
acid sequence with greater than about 65% amino acid 
sequence identity with Arabidopsis AGL8 (SEQ ID N0:2), 

30 preferably greater than about 75% amino acid identity 
with Arabidopsis AGL8 (SEQ ID N0:2), more preferably 
greater than about 85% amino acid identity with 
Arabidopsis AGL8 (SEQ ID N0:2), and can be a sequence 
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having greater than about 90%, 95% or 97% amino acid 
identity with Arabidopsis AGL8 (SEQ ID N0:2). 

Preferably, an AGL8-like gene product is 
orthologous to the seed plant species in which it is 
ectopically expressed. A nucleic acid molecule encoding 
Arabidopsis AGL8 (SEQ ID NO: 2), for example, can be 
ectopically expressed in an Arabidopsis plant to produce 
a non-naturally occurring Arabidopsis variety 
characterized by delayed seed dispersal. Similarly, a 
nucleic acid molecule encoding canola AGL8 can be 
ectopically expressed in a canola plant to produce a 
non-naturally occurring canola variety characterised by 
delayed seed dispersal. 

15 A nucleic acid molecule encoding an AGL8-like 

gene product also can be ectopically expressed in a 
heterologous seed plant to produce a non-naturally 
occurring seed plant characterized by delayed seed 
dispersal. AGAMOUS-like gene products have been widely 

20 conserved throughout the plant kingdom; for example, 
AGAMOUS has been conserved in tomato (TAGl) and maize 
(ZAGl), indicating that orthologs of AGAMOUS-like genes 
are present in most, if not all, angiosperms (Pnueli et 
al.. The Plant Cell 6:163-173 (1994); Schmidt et al.. The 

25 Plant Cell 5:729-737 (1993)). AGL8-like gene products 
such as AGL8 orthologs also can be conserved and can 
function across species boundaries to delay seed 
dispersal. Thus, ectopic expression of a nucleic acid 
molecule encoding Arabidopsis AGL8 (SEQ ID N0:2) in a 

30 heterologous seed plant within the Brassicaceae such as 
Brassica napus L. (rapeseed) or within the Fabaceae such 
as in Glycine (soybean) can alter normal development of 
the dehiscence zone, thereby resulting in delayed seed 
dispersal. Furthermore, a nucleic acid molecule encoding 
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Arahldopsis AGL8 (SEQ ID N0:2), for example, can be 
ectopically expressed in more distantly related 
heterologous seed plants, including dehiscent seed plants 
as well as other dicotyledonous and monocotyledonous 
5 angiosperms and gymnosperms and, upon ectopic expression, 
can alter normal development of the dehiscence zone and 
delay seed dispersal in the heterologous seed plant. 

As used herein, the term "AGL8-like gene 
product" encompasses an active segment of an AGL8-like 

10 gene product, which is a polypeptide portion of an 

AGLB-like gene product that, when ectopically expressed, 
alters normal development of the dehiscence zone and 
delays seed dispersal. An active segment can be, for 
example, an amino terminal, internal or carboxy terminal 

15 fragment of Arahldopsis AGL8 (SEQ ID NO: 2) that, when 
ectopically expressed in a seed plant, alters normal 
development of the dehiscence zone and delays seed 
dispersal. An active segment of an AGL8-like gene 
product can include, for example, the MADS domain and can 

20 have the ability to bind DNA specifically. The skilled 
artisan will recognize that a nucleic acid molecule 
encoding an active segment of an AGL8-like gene product 
can be useful in producing a seed plant of the invention 
characterized by delayed seed dispersal and in the 

25 related methods and kits of the invention described 
further below. 

An active segment of an AGL8-like gene product 
can be identified using the methods described in 
Example I or using other routine methodology. Briefly, a 
30 seed plant such as Arabidopsis can be transformed with a 
nucleic acid molecule under control of a constitutive 
regulatory element such as a tandem CaMV 35S promoter. 
Phenotypic analysis of the seed plant reveals whether a 
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seed plant ectopically expressing a particular 
polypeptide portion is characterized by delayed seed 
dispersal. In transgenic plants in which seed dispersal 
is delayed, further analysis can be performed to confirm 
5 that normal development of the dehiscence zone has been 
altered. For analysis of a large number of polypeptide 
portions of an AGL8-like gene product, nucleic acid 
molecules encoding the polypeptide portions can be 
assayed in pools, and active pools subsequently 
10 subdivided to identify the active nucleic acid molecule. 



In one embodiment, the invention provides a 
non-naturally occurring seed plant that is characterized 
by delayed seed dispersal due to ectopic expression of a 
nucleic acid molecule encoding an AGL8-like gene product 

15 having substantially the amino acid sequence of an AGL8 

ortholog. As used herein, the term "AGL8 ortholog" means 
an ortholog of Arabidopsis AGL8 (SEQ ID N0:2) and refers 
to an AGL8-like gene product that, in a particular seed 
plant variety, has the highest percentage homology at the 

20 amino acid level to Arabidopsis AGL8 (SEQ ID N0:2). An 
AGL8 ortholog can be, for example, a Brasslca AGL8 
ortholog such as a Brasslca napus AGL8 ortholog, or a 
Fabacea AGL8 ortholog such as a soybean, pea, lentil, or 
bean AGL8 ortholog. An AGL8 ortholog from the long-day 

25 plant Slnapls alba, designated SaMADS B, has been 

described (Menzel et al.. Plant J. 9:399-408 (1996), 
which is incorporated herein by reference) . Novel AGL8 
ortholog cDNAs can be isolated from additional seed plant 
species using a nucleotide sequence as a probe and 

30 methods well known in the art of molecular biology (Click 
and Thompson (eds, ) , Methods in Plant Molecular Biology 
and Biotechnology , Boca Raton, FL: CRC Press (1993) ; 
Sambrook et al. (eds,)/ Molecular Cloning: A Laboratory 
Manual (Second Edition), Plainview, NY: Cold Spring 
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Harbor Laboratory Press (1989), each of which is 
incorporated herein by reference) . 

As used herein, the term "substantially the 
amino acid sequence," when used in reference to an AGL8 
5 ortholog, is intended to mean a polypeptide or 

polypeptide segment having an identical amino acid 
sequence, or a polypeptide or polypeptide segment having 
a similar, non-identical sequence that is considered by 
those skilled in the art to be a functionally equivalent 

10 amino acid sequence. For example, an AGL8-like gene 

product having substantially the amino acid sequence of 
Arabidopsis AGLB can have an amino acid sequence 
identical to the sequence of Arabidopsis AGL8 (SEQ ID 
NO: 2) shown in Figure 6, or a similar, ^ noo-ident ical 

15 sequence that is functionally equivalent. In particular, 
an amino acid sequence that is "substantially the amino 
acid sequence" of AGL8 can have one or more modifications 
such as amino acid additions, deletions or substitutions, 
relative to the AGLB amino acid sequence shown (SEQ ID 

20 N0:2), provided that the modified polypeptide retains 

substantially the ability to alter normal development of 
the dehiscence zone and delay seed dispersal when 
ectopically expressed in the seed plant. ' Comparison of 
sequences for substantial similarity can be performed 

25 between two sequences of any length and usually is 
performed with sequences between about 6 and 1200 
residues, preferably between about 10 and 100 residues 
and more preferably between about 25 and 35 residues. 
Such comparisons for substantial similarity are performed 

30 using methodology routine in the art. 

It is understood that minor modifications of 
primary amino acid sequence can result in an AGLB-like 
gene product that has substantially equivalent or 
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enhanced function as compared to the AGL8 ortholog from 
which it was derived. Further, various molecules can be 
attached to an AGL8 ortholog or active segment thereof, 
for example, other polypeptides, antigenic or other 
5 peptide tags, carbohydrates, lipids, or chemical 

moieties. Such modifications are included within the 
term AGL8 ortholog as defined herein. 



One or more point mutations can be introduced 
into a nucleic acid molecule encoding an AGL8 ortholog to 

10 yield a modified nucleic acid molecule using, for 

example, site-directed mutagenesis (see Wu (Ed.), Meth . 
In En7.vmol. Vol. 217, San Diego: Academic Press (1993); 
Higuchi, "Recombinant PGR" in Innis et al. (Ed.), PGR 
Protocols . San Diego: Academic Press, Inc. (1990), each 

15 of which is incorporated herein by reference) . Such 

mutagenesis can be used to introduce a specific, desired 
amino acid insertion, deletion or substitution; 
alternatively, a nucleic acid sequence can be synthesized 
having random nucleotides at one or more predetermined 

20 positions to generate random amino acid substitutions. 
Scanning mutagenesis also can be useful in generating a 
modified nucleic acid molecule encoding substantially the 
amino acid sequence of an AGL8 ortholog. 

Modified nucleic acid molecules can be 
25 routinely assayed for the ability to alter normal 

development of the dehiscence zone and to delay seed 
dispersal. In the same manner as described in Examples I 
and III, a nucleic acid molecule encoding substantially 
the amino acid sequence of an AGL8 ortholog can be 
30 ectopically expressed, for example, using a constitutive 
regulatory element such as the CaMV 35S promoter or using 
a dehiscence zone-selective regulatory element such as 
the AGLl promoter. If such ectopic expression results in 
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a seed plant in which the dehiscence zone fails to 
develop and in which seed dispersal is delayed, the 
modified polypeptide or segment is an **AGL8 ortholog" as 
defined herein. 

5 A non-naturally occurring seed plant of the 

invention that is characterized by delayed seed dispersal 
can be one of a variety of seed plant species, such as a 
dehiscent seed plant or another monocotyledonous and 
dicotyledonous angiosperm or gymnosperm. A useful seed 
10 plant of the invention can be a dehiscent seed plant, and 
a particularly useful seed plant of the invention can be 
a member of the Brasslcaceae, such as rapeseed, or a 
member of the FabaceaB, such as a soybean, pea, lentil or 
bean plant. 

15 As used herein, the term "seed plant" means an 

angiospcrm or gymnosperm. An angiosperm is a 
seed-bearing plant whose seeds are borne in a mature 
ovary (fruit). An angiosperm commonly is recognized as a 
flowering plant. Angiosperms are divided into two broad 

20 classes based on the number of cotyledons, which are seed 
leaves that generally store or absorb food. Thus, a 
monocotyledonous angiosperm is an angiosperm having a 
single cotyledon, whereas a dicotyledonous angiosperm is 
an angiosperm having two cotyledons. A variety of 

25 angiosperms are known including, for example, oilseed 
plants, leguminous plants, fruit-bearing plants, 
ornamental flowers, cereal plants and hardwood trees, 
which general classes are not necessarily exclusive. The 
skilled artisan will recognize that the methods of the 

30 invention can be practiced using these or other 

angiosperms, as desired. A gymnosperm is a seed-bearing 
plant with seeds not enclosed in an ovary. 
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In one embodiment/ the invention provides a 
non-naturally occurring dehiscent seed plant that is 
characterized by delayed seed dispersal due to ectopic 
expression of a nucleic acid molecule encoding an 
5 AGLB-like gene product in the dehiscent seed plant. As 
used herein, the term "dehiscent seed plant" means a seed 
plant that produces a dry dehiscent fruit, which has 
fruit walls that open to permit escape of the seeds 
contained therein. Dehiscent fruits commonly contain 
10 several seeds and include the fruits known, for example, 
as legumes, capsules and siliques. 

In one embodiment, the invention provides a 
non-naturally occurring seed plant that is characterized 
by delayed seed dispersal due to ectopic expression of a 

15 nucleic acid molecule encoding an AGL8-like gene product, 
where the seed plant is a member of the Brasslcaceae . 
The Brasslcaceae, commonly known as the Brassicas, are a 
diverse group of crop plants with great economic value 
worldwide (see, for example, Williams and Hill, Science 

20 232:1385-1389 (1986), which is incorporated herein by 
reference). The Brasslcaceae produce seed oils for 
margarine, salad oil, cooking oil, plastic and industrial 
uses; condiment mustard; leafy, stored, processed and 
pickled vegetables; animal fodders and green manures for 

25 soil rejuvenation. A particularly useful non-naturally 
occurring Brassica seed plant of the invention is the 
oilseed plant canola. 

There are six major Brassica species of 
economic importance, each containing a range of plant* 
30 forms. Brassica napus includes plants such as the 

oilseed rapes and rutabaga. Brassica oleracea are the 
cole crops such as cabbage, cauliflower, kale, kohlrabi 
and Brussels sprouts. Brassica campestrls (Brassica 
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rapa) includes plants such as Chinese cabbage, turnip and 
pak Choi. Brassica juncea includes a variety of 
mustards; Brassica nigra is the black mustard; and 
Brassica carinata is Ethiopian mustard. The skilled 
5 artisan understands that any member of the Brassicaceae 
can be modified as disclosed herein to produce a 
non-naturally occurring Brassica plant characterized by 
delayed seed dispersal. 

In a second embodiment/ the invention provides 

10 a non-naturally occurring seed plant that is 

characterized by delayed seed dispersal due to ectopic 
expression of a nucleic acid molecule encoding an 
AGL8-like gene product, where the seed plant is a member 
of the Fabaceae. The Fabaceae, which are commonly known 

15 as members of the pea family, are seed plants' that 

produce a characteristic dry dehiscent fruit known as a 
legume. The legume is derived from a single carpel and 
dehisces along the suture of the carpel margins and along 
the median vein. The Fabaceae encompass both grain 

20 legumes and forage legumes. Grain legumes include, for 
example, soybean (glycine) , pea, chickpea, moth bean, 
broad bean, kidney bean, lima bean, lentil, cowpea, dry 
bean and peanut. Forage legumes include alfalfa, 
lucerne, birdsfoot trefoil, clover, styiosanthes species, 

25 lotononis bainessii and sainfoin. The skilled artisan 
will recognize that any member of the Fabaceae can be 
modified as disclosed herein to produce a non-naturally 
occurring seed plant of the invention characterized by 
delayed seed dispersal. 

30 A non-naturally occurring seed plant of the 

invention characterized by delayed seed dispersal also 
can be a member of the plant genus Cuphea (family 
Lythraceae) . A Cuphea seed plant is particularly 
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valuable since Cuphea oilseeds contain industrially and 
nutritionally important medium-chain fatty acids, 
especially lauric acid, which is currently supplied only 
by coconut and palm kernel oils. 

5 A non-naturally occurring seed plant of the 

invention also can be, for example, one of the 
monocotyledonous grasses, which produce many of the 
valuable small-grain cereal crops of the world. In a 
non-naturally occurring small grain cereal plant of the 

10 invention, grain remains on the seed plant longer and. 

Ectopic expression of a nucleic acid molecule encoding an 
AGL8-like gene product, or suppression of AGLl and AGL5 
expression as described below, can be useful in 
generating a non-naturally occurring small grain cereal 

15 plant, such as a barley, wheat, oat, rye, orchard grass, 
guinea grass, sorghum or turf grass plant characterized 
by delayed seed dispersal. 

The invention also provides a transgenic seed 
plant that is characterized by delayed seed dispersal due 

20 to ectopic expression of a nucleic acid molecule encoding 
an AGL8-like gene product. In a transgenic seed plant of 
the invention, the ectopically expressed nucleic acid 
molecule encoding an AGL8-like gene product can be 
operatively linked to an exogenous regulatory element. 

25 The invention provides, for example, a transgenic seed 
plant characterized by delayed seed dispersal having an 
ectopically expressed nucleic acid molecule encoding an 
AGL8-like gene product that is operatively linked to an 
exogenous constitutive regulatory element. In one 

30 embodiment, the invention provides a transgenic seed 
plant that is characterized by delayed seed dispersal 
due to ectopic expression of an exogenous nucleic acid 
molecule encoding substantially the amino acid sequence 
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of an AGL8 ortholog operatively linked to an exogenous 
cauliflower mosaic virus 35S promoter. 

The invention also provides a transgenic seed 
plant that is characterized by delayed seed dispersal 
5 due to ectopic expression of a nucleic acid molecule 

encoding an AGL8-like gene product operatively linked to 
a dehiscence zone-selective regulatory element. The 
dehiscence zone-selective regulatory element can be, for 
example, an AGLl regulatory element or AGL5 regulatory 

10 element. The AGLl regulatory element can be derived from 
the Arabldopsis AGLl genomic sequence disclosed herein as- 
5EQ ID NO: 3 and can be, for example, a 5' regulatory 
sequence or intronic regulatory element. Similarly, the 
AGL5 regulatory element can be derived from the 

15 Arabldopsis AGL5 genomic sequence disclosed herein as SEQ 
ID NO: 4 and can be, for example, a 5* regulatory sequence 
or intronic regulatory element. 

In one embodiment, a transgenic seed plant of 
the invention has an ectopically expressed exogenous 

20 nucleic acid molecule encoding substantially the amino 

acid sequence of an AGL8 ortholog operatively linked to a 
dehiscence zone-selective regulatory element that is an 
AGLl regulatory element having at least fifteen 
contiguous nucleotides of nucleotides 1 to 2599 of SEQ ID 

25 N0:3; nucleotides 2833 to 4128 of SEQ ID N0:3; 

nucleotides 4211 to 4363 of SEQ ID NO:3; nucleotides 4426 
to 4554 of SEQ ID NO: 3; nucleotides 4796 to 4878 of SEQ 
ID NO: 3; nucleotides 4921 to 5028 of SEQ ID NO: 3; or 
nucleotides 5421 to 5682 of SEQ ID N0:3. 

30 In another embodiment, a transgenic seed plant 

of the invention has an ectopically expressed exogenous 
nucleic acid molecule encoding substantially the amino 



BNSDOCID -:WO 9900502A1 I > 



wo 99/00502 PCTAJS98/13208 

28 

acid sequence of an AGL8 ortholog opezatively linked to a 
dehiscence zone-selective regulatory element that is an 
AGL5 regulatory element having at least fifteen 
contiguous nucleotides of nucleotides 1 to 1890 of SEQ ID 
5 NO: 4; nucleotides 2536 to 2683 of SEQ ID N0:4; 

nucleotides 2928 to 5002 of SEQ ID NO: 4; nucleotides 5085 . 
to 5204 of SEQ ID NO: 4; nucleotides 5367 to 5453 of SEQ 
ID N0:4; nucleotides 5645 to 5734 of SEQ ID N0:4; or 
nucleotides 6062 to 6138 of SEQ ID N0:4. 

10 As used herein, the term "transgenic" refers to 

a seed plant that contains an exogenous nucleic acid 
molecule, which can be derived from the same seed plant 
species or a heterologous seed plant species. 

The term "exogenous," as used herein in 
reference to a nucleic acid molecule and a transgenic 
seed plant, means a nucleic acid molecule originating 
from outside the seed plant. An exogenous nucleic acid 
molecule can be, for example, a nucleic acid molecule 
encoding an AGLS-like gene product or an exogenous 
regulatory element such as a constitutive regulatory 
element or a dehiscence zone-selective regulatory 
element, as described further below. An exogenous 
nucleic acid molecule can have ' a naturally occurring or 
non-naturally occurring nucleotide sequence and can be a 
heterologous nucleic acid molecule derived from a 
different seed plant species than the seed plant into 
which the nucleic acid molecule is introduced or can be a 
nucleic acid molecule derived from the same seed plant 
species as the seed plant into which it is introduced. 

The term "operatively linked, " as used in 
reference to a regulatory element and a nucleic acid 
molecule, means that the regulatory element confers 
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regulated expression upon the operatively linked nucleic 
acid molecule. Thus, the term "operatively linked," as 
used in reference to an exogenous regulatory element such 
as a dehiscence zone-selective regulatory element and a 
5 nucleic acid molecule encoding an AGL8-like gene product, 
means that the dehiscence zone-selective regulatory 
element is linked to the nucleic acid molecule encoding 
an AGL8-like gene product such that the expression 
pattern of the dehiscence zone-selective regulatory 

10 element is conferred upon the nucleic acid molecule 

encoding the AGL8-like gene product. It is recognized 
that a regulatory element and a nucleic acid molecule 
that are operatively linked have, at a minimum, all 
elements essential for transcription, including, for 

15 example, a TATA box. 

As used herein, the term "constitutive 
regulatory element*' means a regulatory element that 
confers a level of expression upon an operatively linked 
nucleic molecule that is relatively independent of the 
20 cell or tissue type in which the constitutive regulatory 
element is expressed. A constitutive regulatory element 
that is expressed in a seed plant generally is widely 
expressed in a large number of cell and tissue types. 



25 A variety of constitutive regulatory elements 

useful for ectopic expression in a transgenic seed plant 
are well known in the art. The cauliflower mosaic 
virus 35S (CaMV 35S) promoter, for example, is a 
well-characterized constitutive regulatory element that 

30 produces a high level of expression in all plant tissues 
(Odell et al.. Nature 313:810-812 (1985)). . The CaMV 35S 
promoter can be particularly useful due to its activity 
in numerous diverse seed plant species (Benfey and Chua, 
Science 250:959-966 (1990); Futterer et al., Physiol . 
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Plant 79:154 (1990); Odell et al., supra, 1985). A 
tandem 35S promoter, in which the intrinsic promoter 
element has been duplicated, confers higher expression 
levels in comparison to the unmodified 35S promoter (Kay 
5 et al.. Science 236:1299 (1987)). Other constitutive 
regulatory elements useful for ectopically expressing a 
nucleic acid molecule encoding an AGL8-like gene product 
in a transgenic seed plant of the invention include, for 
example, the cauliflower mosaic virus 19S promoter; the 
10 Figwort mosaic virus promoter; and the nopaline synthase 
(nos) gene promoter (Singer et al.^ Plant Mol . 
Bi ol . 14:^ 33 (1990); An, Plant Physiol. 81: 86 (1986)). 



Additional constitutive regulatory elements 
including those for efficient ectopic expression in 

15 monocots also are known in the art, for example, the pEmu 
promoter and promoters based on the rice Actin-1 
5' region (Last et al., Theor. Appl . Genet. 81:581 
(1991); Mcelroy et al . , Mol . Gen. Genet, 231:150 (1991); 
Mcelroy et al.. Plant Cell 2:163 (1990)). Chimeric 

20 regulatory elements, which combine elements from 

different genes, also can be useful for ectopically 
• expressing a nucleic acid molecule encoding an AGLB-like 
gene product (Comai et al.. Plant Mol . Biol . 15:373 
(1990)). One skilled in the art understands that a 

25 particular constitutive regulatory element is chosen 
based, in part, on the seed plant species in which a 
nucleic acid molecule encoding an AGL8-like gene product 
is to be ectopically expressed and on the desired level 
of expression. 

30 An exogenous regulatory element useful in a 

transgenic seed plant of the invention also can be an 
inducible regulatory element, which is a regulatory 
element that confers conditional expression upon an 
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operatively linked nucleic acid molecule, where 
expression of the operatively linked nucleic acid 
molecule is increased in the presence of a particular 
inducing agent or stimulus as compared to expression of 
5 the nucleic acid molecule in the absence of the inducing 
agent or stimulus. Particularly useful inducible 
regulatory elements include copper-inducible regulatory 
elements (Mett et al., Proc. Matl. Acad. Sci . 
IISA 90:4567-4571 (1993); Furst et al., Cell 55:705-717 
10 (1988)); tetracycline and chlor-tet racycline-inducible 
regulatory elements (Gatz et al . , Plant J. 2:397-404 
(1992); Roder et al,, Mol . Gen. Genet. 243:32-38 (1994); 
Gatz, Meth. Cell Biol. 50:411-424 (1995)); ecdysone 
inducible regulatory elements ( Chr istopherson et al . , 
15 Proc. Natl. Acad. Scj . USA 89:6314-6318 (1992); 

Kreutzweiser et al., Ecotoxicol. Environ. Safety 28:14-24 
(1994)); heat shock inducible regulatory elements 
(Takahashi et al.. Plant Physiol. 99:383-390 (1992); Yabe 
et al.. Plant Cell Phvsiol. 35:1207-1219 (1994); Ueda et 
20 al.. Mo] . Gen . Genet . 250:533-539 (1996)); and lac operon 
elements, which are used in combination with a 
const itutively expressed lac repressor to confer, for 
example, IPTG-inducible expression (Wilde et al., 
EMBO J. 11:1251-1259 (1992)). 

25 An inducible regulatory element useful in the 

transgenic seed plants of the invention also can be, for 
example, a ni trate-inducible promoter derived from the 
spinach nitrite reductase gene (Back et al., Plant Mol. 
Piol - 17:9 (1991)) or a light-inducible promoter, such as 

30 that associated with the small subunit of RuBP 

carboxylase or the LHCP gene families (Feinbaum et al,, 
. Mol. Ge n. Genet. 226:449 (1991); Lam and Chua, 

Science 248:471 (1990)). Additional inducible regulatory 
elements include salicylic acid inducible regulatory 
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elements (Uknes et al,. Plant Cell 5:159-169 (1993); Bi 
et al.. Plant J. 8:235-245 (1995)); plant 
hormone- inducible regulatory elements 

( Yamaguchi-Shinozaki et al.. Plant Mol. Biol. 15:905 ' 
5 (1990); Kares et al.. Plant Mol. Biol. 15:225 (1990)); 
and human hormone-inducible regulatory elements such as 
the human glucocorticoid response element (Schena et al., 
Proc. Natl. Acad. Sci . USA 88:10421 (1991)). 

It should be recognized that a non-naturally 
10 occurring seed plant of the invention, which contains an 
ectopically expressed nucleic acid molecule encoding an 
AGL8-like gene product, also can contain one or more 
additional modifications, including naturally and 
non-naturally occurring modifications, that can modulate 
15 the delay in seed dispersal. For example, the plant 

hormone ethylene promotes fruit dehiscence, and modified 
expression or activity of positive or negative regulators 
of the ethylene response can be included in a seed plant 
of the invention (see, generally, Meakin and Roberts, 
20 ExD. Botany 41:1003-1011 (1990); Ecker, Science 

268:667-675 (1995); Chao et al . , Cell 89:1133-1144 
(1997) ) . 

Mutations in positive regulators of the 
ethylene response show a reduction or absence of 

25 responsiveness to treatment with exogenous ethylene. 
Arabidopsls mutations in positive regulators of the 
ethylene response include mutations in etr, which 
inactivate a histidine kinase ethylene receptor (Bleeker 
et al., Science 241:1086-1089 (1988); Schaller and 

30 Bleeker, Science 270:1809-1811 (1995)); ers (Hua et al.. 
Science 269:1712-1714 (1995)); eln2 (Guzman and Ecker, 
Plant Cell 2:513 (1990)); ein3 (Rothenberg and Ecker, 
Sem. Dev. Biol. Plant Dev. Genet. 4:3-13 (1993); Kieber 
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and Ecker, Trends Genet. 9:356-362 (1993)); ainl (van der 
Straeten et al.. Plant Phvsiol, 102:401-408 (1993)); eti 
(Harpham et al., An. Bot . 68:55 (1991)) and ein4 , einS, 
einS, and ein7 (Roman et al.. Genetics 139: 1393-1409 
5 (1995) ) . Similar genetic functions are found in other 
seed plant species; for example, the never-ripe mutation 
corresponds to etr and confers ethylene insensit ivity in 
tomato (Lanahan et al.. The Plant Cell 6:521-530 (1994); 
Wilkinson et al.. Science 270:1807-1809 (1995)). A seed 

10 plant of the invention can include a modification that 
results in altered expression or activity of any such 
positive regulator of the ethylene response. A mutation 
in a positive regulator, for example, can be included in 
a seed plant of the invention and can modify the delay in 

15 seed dispersal in such plants, for example, by further 
postponing the delay in seed dispersal. 

Mutations in negative regulators of the 
ethylene response display ethylene responsiveness in the 
absence of exogenous ethylene. Such mutations include 

20 those relating to ethylene overproduction, for example, 
the etol , eto2 , and eto3 mutants, and those relating to 
constitutive activation of the ethylene signalling 
pathway, for example, mutations in CTRl ^ a negative 
regulator with sequence similarity to the Raf family of 

25 protein kinases (Kieber et al . , Cell 72:427-441 (1993), 

which is incorporated herein by reference) . A seed plant 
of the invention can include a modification that results 
in altered expression or activity of any such negative 
regulator of the ethylene response. A mutation resulting 

30 in ethylene responsiveness in the absence of exogenous 

ethylene, for example, can be included in a non-naturally 
occurring seed plant of the invention and can modify, for 
example, diminish, the delay in seed dispersal. 



wo 99O0502A1 f > 



wo 99/00502 PCT/US98/13208 

.34 

Fruit morphological mutations also can be 
included in a seed plant of the invention. Such 
mutations include those in carpel identity genes such as 
AGAMOUS (Bowman et al., supra, 1989; Yanofsky et al., 
5 supra, 1990) and in genes required for normal fruit 

development such as ETTIN, CRABS CLAW, SPATULA, AGL8 and 
TOUSLED (Sessions et al . , Development 121:1519-1532 
(1995); Alvarez and Smyth, Flowering Newsletter 23:12-17 
(1997); and Roe et al . , Cell 75:939-950 (1993)). Thus, 
10 it is understood that a seed plant of the invention 

having an ectopically expressed nucleic acid molecule 
encoding an AGL8-like gene product can include one or 
more additional genetic modifications, which can diminish 
or enhance the delay in seed dispersal. 

15 The present invention also provides methods of 

producing a non-naturally occurring seed plant 
characterized by delayed seed dispersal. A method of the 
invention entails ectopically expressing a nucleic acid 
molecule encoding an AGL8-like gene product in the seed 

20 plant, whereby seed dispersal is delayed due to ectopic 
expression of the nucleic acid molecule. 



As discussed above, the term "ectopically" 
refers to expression of a nucleic acid molecule encoding 
an AGL8-like gene product in a cell type other than a 

25 cell type in which the nucleic acid molecule is normally 
expressed, at a time other than a time at which the 
nucleic acid molecule is normally expressed or at n 
expression level other than the level at which the 
nucleic acid normally is expressed. In wild type 

30 Arabidopsis , for example, AGL8 expression is normally 

restricted during the later stages of floral development 
to the carpel valves and is not seen in the outer replum. 
In the methods of the invention, particularly useful 
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ectopic expression of a nucleic acid molecule encoding an 

AGL8-like gene product involves expression in the cells 
of the outer replum, which are the progenitors of the 
dehiscence zone. 

5 Actual ectopic expression of an AGL8-like gene 

product is dependent on various factors. The ectopic 
expression can be widespread expression throughout most 
or all plant tissues or can be expression restricted to a 
small number of plant tissues, and can be achieved by a 

10 variety of routine techniques. Mutagenesis, including 
seed or pollen mutagenesis, can be used to generate a 
non-naturally occurring seed plant, in which a nucleic 
acid molecule encoding an AGL8-like gene product is 
ectopically expressed. Ethylmethane sulfonate (EMS) 

15 mutagenesis, transposon mediated mutagenesis or T-DNA 
mediated mutagenesis also can be useful in ectopically 
expressing an AGLB-like gene product to produce a seed 
plant characterized by delayed seed dispersal (see, 
generally. Click and Thompson, supra, 1993). While not 

20 wishing to be bound by any particular mechanism, ectopic 
expression in a mutagenized plant can result from 
- inactivation of one or more negative regulators of AGL8, 
for example, from the combined inactivation of AGLl and 
AGL5. 



25 Ectopic expression of an AGL8-like gene product 

also can be achieved by expression of a nucleic acid 
encoding an AGL8-like gene product from a heterologous 
regulatory element or from a modified variant of its own 
promoter. Heterologous regulatory elements include 

30 constitutive regulatory elements, which result in 

expression of the AGL8-like gene product in the outer 
replum as well as in a variety of other cell types, and 
dehiscence zone-selective regulatory elements, which 
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produce selective expression of an AGL8-like gene product 
in a limited number of cell types including the cells of 
the valve margin or the dehiscence zone. 

Ectopic expression of a nucleic acid molecule 
5 encoding an AGL8-like gene product can be achieved using 
an endogenous or exogenous nucleic acid molecule encoding 
an AGL8-like gene product. A recombinant exogenous 
nucleic acid molecule can contain a heterologous 
regulatory element that is operatively linked to a 

10 nucleic acid sequence encoding an AGL8-like gene product. 
Methods for producing the desired recombinant nucleic 
acid molecule under control of a heterologous regulatory 
element and for producing a non-natural ly occurring seed 
plant of the invention are well known in the art (see, 

15 generally, Sambrook et al., supra ^ 1989'; Glick and 
Thompson, supra, 1993). 

An exogenous nucleic acid molecule can be 
introduced into a seed plant for ectopic expression using 
a variety of transformation methodologies including 
/igroba c teri u/n-mediated transformation and direct gene 
transfer methods such as electroporation and 
micropro j ect ile-mediated transformation ( see, generally, 
Wang et al . (eds) , Transformation of Plants and Soil 
Microorgani sms , Cambridge, UK: University Press (1995'), 
which is incorporated herein by reference) . 
Transformation methods based upon the soil bacterium 
Agrohacterlum tumefaciens are particularly useful for 
introducing an exogenous nucleic acid molecule into a 
seed plant. The wild type form of AgroJbacteriuin contains 
a Ti (tumor-inducing) plasmid that directs production of 
tumorigenic crown gall growth on host plants. Transfer 
of the tumor-inducing T-DNA region of the Ti plasmid to a 
plant genome requires the Ti plasmid-encoded virulence 
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mustard, and flax, and transgenic plants of the Fabaceae 
family such as soybean, pea, lentil and bean. 

Micropro j ectile-mediated transformation also 
can be used to produce a transgenic seed plant that 
5 ectopically expresses an AGL8-like gene product. This 

method, first described by Klein et al. ( Nature 327:70-73 
(1987), which is incorporated herein by reference), 
relies on micropro j ectiles such as gold or tungsten that 
are coated with the desired nucleic acid molecule by 
10 precipitation with calcium chloride, spermidine or PEG. 
The micropro jectile particles are accelerated at high 
speed into an angiosperm tissue using a device such as 
the BIOLISTIC PD-1000 (Biorad; Hercules CA) . 

Micropro jectile-mediated delivery or "particle 

15 bombardment" is especially useful to transform seed 
plants that are difficult to transform or regenerate 
using other methods. Micropro j ect i le-mediated 
transformation has been used, for example, to generate a 
variety of transgenic plant species, including cotton, 

20 tobacco, corn, hybrid poplar and papaya (see Click and 
Thompson, supra ^ 1993) as well as cereal crops such as 
wheat, oat, barley, sorghum and rice {Duan et al.. Nature 
Biotech. 14:494-498 { 1 996 ) ; Shimamoto , Curr. Opin. 
Biotech . 5:158-162 (1994), each of which is incorporated 

25 herein by reference) . In view of the above, the skilled 
artisan will recognize that ^grojbacteri ujn-mediated or 
micropro jectile-mediated transformation , as disclosed 
herein, or other methods known in the art can be used to 
introduce a nucleic acid molecule encoding an AGL8-like 

30 gene product into a seed plant for ectopic expression. 

In another embodiment, the invention provides a 
non-naturally occurring seed plant that is characterized 
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by delayed seed dispersal due to suppression of both AGLl 
expression and AGL5 expression in the seed plant. Such a 
non-natural ly occurring seed plant characterized by 
delayed seed dispersal can be, for example, an agll agl5 
5 double mutant. 

As disclosed herein, loss-of -function mutations 
in the AGLl and AGL5 genes were produced by a combination 
of homologous recombination and disruptive T-DNA 
insertion (see Example II), Neither AGLl nor AGL5 RNA 

10 was expressed in the resulting agll agl5 double mutant, 
and scanning electron microscopy revealed that the 
dehiscence zone failed to develop normally in these 
mutant seed plants. Furthermore, the mature fruits of 
these seed plants failed to undergo dehiscence, as shown 

15 in Figure 5, These results indicate that AGLl or AGL5 

gene expression is required for normal development of the 
dehiscence zone and that suppression of AGLl expression 
com.bined with suppression of AGL5 expression in the seed 
plant can delay dehiscence, allowing the process of pod 

20 shatter to be controlled. 

The y^rajbidopsis AGLl and AGL5 genes encode MADS 
box proteins with 85% identity at the amino acid level 
(see Tables 1 and 2) . The AGLl and AGL5 RNA expression 
patterns also are strikingly similar. In particular, 

25 both RNAs are specifically expressed in flowers, where 
they accumulate in developing carpels. In particular, 
strong expression of these genes is observed in the outer 
replum along the valve/replum boundary (Ma et al., supra ^ 
1991; Savidge et al.. The Plant Cell 7:721-723 (1995); 

30 Flanagan et al . , The Plant Journal 10:343-353 (1996), 
each of which is incorporated herein by reference) . 
Thus, AGLl and AGL5 are expressed in the valve margin, at 
least within the cells of the outer replum. 



BNSDOCID <:WO 9900502A1 i > 



wo 99/00502 PCT/US98/1 3208 



40 



5 



Table 1 

Amino acid identity in the MADS domain and K-domain of 

AGAMOUS, AGLl and AGL5 




AGAMOUS 


AGLl . 


AGL5 




MADS 


K 


MADS 


K 


MADS 


K 


AGAMOUS 






95% 


68% 


95% 


62% 


AGLl 










100% 


92% 


AGL5 















Table 2 

Amino acid identity in the I-domain and C-domain of 

AGAMOUS, AGLl and AGL5 




AGAMOUS 


AGLl 


AGL5 




I 


C 


I 


C 


I 


C 


AGAMOUS 














AGLl 


71% 


39% 










AGL5 


65% 


37% 


95% 


72% 







As used herein, the term "AGLl" refers to 
15 Arahidopsis AGLl (SEQ ID NO: 6) or an ortholog of 

Arabidopsis AGLl (SEQ ID NO: 6). An AGLl ortholog is a 
MADS box gene product expressed, at least in part, in the 
valve margins of a seed plant and having homology to the 
amino acid sequence of Arahidopsis AGLl (SEQ ID NO: 6). 
20 AGLl or an AGLl ortholog can function, in part, by 

forming a complex with an AGL8-like gene product. An 
AGLl ortholog generally has an amino acid sequence having 
at least about 63% amino acid identity with Arabidopsis 
AGLl (SEQ ID NO: 6) and includes polypeptides having 
25 greater than about 70%, 75%, 85% or 95% amino acid 

identity with Arabidopsis AGLl (SEQ ID NO: 6). Given the 
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close relatedness of the AGLl and AGL5 gene products, one 
skilled in the art will recognize that an AGLl ortholog 
can be distinguished from an AGL5 ortholog by being more 
closely related to Arabidopsis AGLl (SEQ ID NO: 6) than to 
5 Arabidopsis AGL5 (SEQ ID NO: 8). An AGLl ortholog can 
function in wild type plants, like Arabidopsis AGLl, to 
limit the domain of AGL8-like gene product expression to 
the carpel valves during the later stages of floral 
development . 

10 As used herein, the term '^AGLS" refers to 

Arabidopsis AGL5 (SEQ ID NO: 8) or to an ortholog of 
Arabidopsis AGL5 (SEQ ID N0:8). An AGL5 ortholog is a 
MADS box gene product expressed, at least in part, in the 
valve margins of a seed plant and having homology to the 

15 amino acid sequence of Arabidopsis AGL5 • (SEQ ID N0:8), 
AGLS or . an AGL5 ortholog can function, in part, by 
forming a complex with an AGLB-like gene product a.s shown 
in Example IV. An AGL5 ortholog generally has an amino 
acid sequence having at least about 60% amino acid 

20 identity with Arabidopsis AGLS (SEQ ID NO: 8) and includes 
polypeptides having greater than about 65%, 70%, 75%, 85% 
or 95% amino acid identity with Arabidopsis AGLS (SEQ ID 
N0:8). Given the close relatedness of the AGLl and AGLS 
gene products, one skilled in the art will recognize that 

25 an AGLS ortholog can be distinguished from an AGLl 

ortholog by being more closely related to Arabidopsis 
AGLS (SEQ ID NO: 8) than to Arabidopsis AGLl (SEQ ID 
NO: 6). An AGLS ortholog can function in wild type 
plants, like Arabidopsis AGLS, to limit the domain of 

30 AGL8-like gene product expression to the carpel valves 
during the later stages of floral development. 

The term "suppressed, " as used herein in 
reference to AGLl expression, means that the amount of 
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functional AGLl protein is reduced in a seed plant in 
comparison with the amount of functional AGLl protein in 
the corresponding wild type seed plant. Similarly, when 
used in reference to AGL5 expression, the term suppressed 
5 means that the amount of functional AGL5 protein is 

reduced in a seed plant in comparison with the amount of 
functional AGL5 protein in the corresponding wild type 
seed plant. Thus, the term "suppressed," as used herein, 
encompasses the absence of AGLl or AGL5 protein in a seed 

10 plant, as well as protein expression that is present but 
reduced as compared to the level of AGLl or AGL5 protein 
expression in a wild type seed plant. Furthermore, the 
term suppressed refers to AGLl or AGL5 protein expression 
that is reduced throughout the entire domain of AGLl or 

15 AGL5 expression, or to expression that is reduced in some 
part of the AGLl or AGL5 expression domain, provided that 
the resulting seed plant is characterized by delayed seed 
dispersal . 

As used herein, the term "suppressed" also 
20 encompasses an amount of AGLl or AGL5 protein that is 
equivalent to wild type AGLl or AGL5 expression, but 
where the AGLl or AGL5 protein has a reduced level of 
activity. As discussed above, AGLl and AGL5 each contain 
a conserved MADS domain; point mutations or gross 
25 deletions within the MADS domain that reduce the 

DNA-binding activity of AGLl or AGL5 can reduce or 
destroy the activity of AGLl or AGL5 and, therefore, 
"suppress" AGLl or AGL5 expression as defined herein. 
One skilled in the art will recognize that, preferably, 
30 AGLl expression is - essentially absent in the valve margin 
of a seed plant or the AGLl protein is essentially 
non-functional and, similarly, that, preferably, AGL5 
expression is essentially absent in the valve margin of 
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the seed plant or the AGL5 protein is essentially 
non-f unct ional . 

A variety of methodologies can be used to 
suppress AGLl or AGL5 expression in a seed plant. 
5 Suppression can be achieved by directly modifying the 

ACL2 or ACL5 genomic locus, for example, by modifying an 
AGLl or AGL5 regulatory sequence such that transcription 
or translation from the AGLl or AGL5 locus is reduced, or 
by modifying an AGLl or AGL5 coding sequence such that 

10 non- functional AGLl or AGL5 protein is produced. 

Suppression of AGLl or AGL5 expression in a seed plant 
also can be achieved indirectly, for example, by 
modifying the expression or activity of a protein that 
regulates AGLl or AGL5 expression. Methodologies for 

15 effecting suppression of AGLl or AGL5 expression in a 
seed plant include, for example, homologous 
recombination, chemical and t ransposon-mediated 
mutagenesis, cosuppress ion and antisense-based techniques 
and dominant negative methodologies. 

20 Homologous recombination of AGLl or AGL5 can be 

used to suppress AGLl or AGL5 expression in a seed plant 
as described in Kempin et al . , Nature 389:802-803 (1997), 
which is incorporated herein by reference. Homologous 
recombination can be used, for example, to replace the 

25 wild type AGL5 genomic sequence with a construct in which 
the gene for kanamycin resistance is flanked by at least 
about 1 kb of AGL5 sequence. The use of homologous 
recombination to suppress AGL5 expression is set forth in 
Example I I . 

30 Suppression of AGLl or AGL5 expression also can 

be achieved by producing a loss-of -function mutation 
using transposon-media ted insertional mutagenesis with Ds 
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transposons or Stm transposons (see, for example, 
Sundaresan et al . , Genes Devel. 9:1797-1810 (1995), which 
is incorporated herein by reference). Insertion of a 
transposon into an AGLl or AGL5 target gene can be 
5 identified, for. example, by restriction mapping, which 
can identify the presence of an insertion in the gene 
promoter or in the coding region, such that expression of 
functional gene product is suppressed. Insertion of a 
transposon also can be identified by detecting an absence 

10 of the mRNA encoded by the target gene or by the 

detecting the absence of the gene product in valve 
margin. Suppression of AGLl or AGL5 expression also can 
be achieved by producing a loss-of -function mutation 
using T-DNA-mediated insertional mutagenesis (see Krysan 

15 et al., Proc. Natl. Acad. Sci . , USA 93:8145-8150 (1996)). 
The use of T-DNA-mediated insertional mutagenesis to 
suppress AGLl expression is disclosed in Example II. 

Suppression of AGLl or AGL5 expression in a 
seed plant also can be achieved using cosuppression, 

20 which is a well known methodology that relies on 

expression of a nucleic acid molecule in the sense 
orientation to produce coordinate silencing of the 
introduced nucleic acid molecule and the homologous 
endogenous gene (see, for example, Flavell, Proc. Natl. 

25 Acad. Sci . , USA 91:3490-3496 (1994); Kooter and Mol, 

Current Opin. Biol. 4:166-171 (1993), each of which is 
incorporated herein by reference) . Cosuppression is 
induced most strongly by a large number of transgene 
copies or by overexpression of transgene RNA and can be 

30 enhanced by modification of the transgene such that it 
fails to be translated, 

Antisense nucleic acid molecules encoding AGLl 
and AGL5 gene products, or fragments thereof, also can be 
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used to suppress expression of AGLl and AGL5 in a seed 
plant. Antisense nucleic acid molecules reduce mRNA 
translation or increase mRNA degradation, thereby 
suppressing gene expression {see, for example, Kooter and 
5 Mol, supra, 1993; Pnueli et al.. The Plant Cell Vol. 6, 
175-186 (1994), which is incorporated herein by 
reference) . 

To produce a non-naturally occurring seed 
plant of the invention, in which AGLl and AGL5 expression 

10 each are suppressed, the one or more sense or antisense 
nucleic acid molecules can be expressed under control of 
a strong regulatory element that is expressed, at least 
in part, in the valve margin of the seed plant. The 
constitutive CaMV 35S promoter (Odell et ^1 . , 

15 supra, 1985), for example, or other constitutive 

promoters as disclosed herein, can be useful in the 
methods of the invention. Dehiscence zone-selective 
regulatory elements also can be useful for expressing one 
or more sense or antisense nucleic acid molecules in 

20 order to suppress AGLl and AGL5 expression in a seed 
plant 

The skilled artisan will recognize that 
effective suppression of endogenous AGLl and AGL5 gene 
expression depends upon the one or more introduced 

25 nucleic acid molecules having a high percentage of 

homology with the corresponding endogenous gene loci. 
Nucleic acid molecules encoding Arabidopsis AGLl (SEQ ID 
NO: 5) and AGL5 (SEQ ID NO: 7) are provided herein (see, 
also. Ma et al., supra, 1991). Nucleic acid molecules 

30 encoding Arabidopsis AGLl and AGL5 can be useful in the • 
methods of the invention or for isolating orthologous 
AGLl and AGL5 sequences. 
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The homology requirement for effective 
suppression using homologous recombination, cosuppression 
or antisense methodology can be determined empirically. 
In general, a minimum of about 80-90% nucleic acid 
5 sequence identity is preferred for effective suppression 
of AGLl or AGL5 expression. Thus, a nucleic acid 
molecule encoding a gene ortholog from the family or 
genus of the seed plant species into which the nucleic 
acid molecule is to be introduced is preferred for 

10 generating the non-naturally occurring seed plants of the 
invention using homologous recombination, cosuppression 
or antisense technology. More preferably, a nucleic acid, 
molecule encoding a gene ortholog from the same seed 
plant species is used for suppressing AGLl expression and 

15 AGL5 expression in a seed plant of the invention. For 

example, nucleic acid molecules encoding canola AGLl and 
AGL5 are preferable for suppressing AGLl and AGL5 
expression in a canola plant. 

Although use of a highly homologous nucleic 
20 acid molecule is preferred in the methods of the 

invention, the nucleic acid molecule to be used for 
homologous recombination, cosuppression or antisense 
suppression need not contain in its entirety the AGLl or 
AGL5 sequence to be suppressed. Thus, a sense or 
25 antisense nucleic acid molecule encoding only a portion 
of Arabidopsls AGLl (SEQ ID NO: 5), for example, or a 
sense or antisense nucleic acid molecule encoding only a 
portion of Arabidopsls AGL5 (SEQ ID NO: 7) can be useful 
for producing a non-naturally occurring seed plant of the 
30 invention, in which AGLl and AGL5 expression each are 
suppressed . 

A portion of a nucleic acid molecule to be 
homologously recombined with an AGLl or AGL5 locus 
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generally contains at least about 1 kb of sequence 
homologous to the targeted gene and preferably contains 
at least about 2 kb, more preferably at least about 3 kb 
and can contain at least about 5 kb of sequence 
5 homologous to the targeted gene. A portion of a nucleic 
acid molecule encoding an AGLl or AGL5 to be used for 
cosuppression or antisense suppression generally contains 
at least about 50 base pairs to the full-length of the 
nucleic acid molecule encoding the AGLl or AGL5 ortholog. 
10 In contrast to an active segment, as defined herein, a 
portion of a nucleic acid molecule to be used for 
homologous recombination, cosuppression or antisense 
suppression need not encode a functional part of a gene 
product . 

15 A dominant negative construct also can be used 

to suppress AGLl or AGL5 expression in a seed plant. A 
dominant negative construct useful in the invention 
generally contains a portion of the "complete AGLl or AGL5 
coding sequence sufficient, for example, for DNA-binding 

20 or for a protein-protein interaction such as a 

homodimeric or heterodimeric protein-protein interaction 
but lacking the transcriptional activity of the wild type 
protein. For example, a carboxy-terminal deletion mutant 
of AGAMOUS was used as a dominant negative construct to 

25 suppress expression of the MADS box gene AGAMOUS 

(Mizukami et al.. Plant Cell 8:831-844 (1996), which is 
incorporated by reference herein) . One skilled in the 
art understands that, similarly, a dominant negative AGLl 
or AGL5 construct can be used to suppress AGLl or AGL5 

30 expression in a seed plant. A useful dominant negative 
construct can be a deletion mutant encoding, for example, 
the MADS box domain alone ("M"), the MADS box domain and 
"intervening" region ("MI"); the MADS box, "intervening" 
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and "K" domains ("MIK"); or the "intervening," "K" and 
carboxy-terminal domains ("IKC"). 

In a preferred embodiment, a non-naturally 
occurring seed plant of the invention is an agll agl5 
5 double mutant. An agll agl5 double mutant is a 

particularly useful non-naturally occurring seed plant 
that is characterized by delayed seed dispersal. 

As used herein, the term "agll agl5 double 
mutant" means a seed plant having a loss-of-f unction 
mutation at the AGLl locus and a loss-of-f unction 
mutation at the AGL5 locus. Loss-of-f unction mutations 
encompass point mutations, including substitutions, 
deletions and insertions, as well as gross modifications 
of an AGLl and AGL5 locus and can be located in coding or 
non-coding sequences. One skilled in the art understands 
that any such loss-of-f unction mutation at the AGLl locus 
can be combined with any such mutation at the AGL5 locus 
to generate an agll agl5 double mutant of the invention. 
Production of an exemplary agll agl5 double mutant in the 
Brassica seed plant Arabidopsis is disclosed herein in 
Example II . 

AGLl and AGL5 are closely related genes that 
have diverged relatively recently. While not wishing to 
be bound by the following, some plants can contain only 
25 AGLl or only AGL5 ^ or can contain a single ancestral gene 
related to AGLl and AGL5 . In such plants, a seed plant 
characterized by delayed seed dispersal can be produced 
by suppressing only expression of AGLl, or expression of 
AGL5, or expression of a single ancestral gene related to 
30 AGLl and AGL5 . Thus, the present invention provides a 
non-naturally occurring seed plant characterized by 
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delayed seed dispersal, in which AGLl expression is 
suppressed. Such a non-naturally occurring seed plant 
characterized by delayed seed dispersal can be, for 
example, an agll single mutant. The present invention 
5 also provides a non-naturally occurring seed plant 

characterized by delayed seed dispersal, in which Ai3L5 
expression is suppressed. A non-naturally occurring seed 
plant characterized by delayed seed dispersal in which 
AGL5 expression is suppressed can be, for example, an 
10 agl5 single mutant. 

The present invention further provides tissues 
derived from non-naturally occurring seed plants of the 
invention. In one embodiment, the invention provides a 
tissue derived from a non-naturally occurring seed plant 

15 that has an ectopically expressed nucleic acid molecule 
encoding an AGL8-like gene product and is characterized 
by delayed seed dispersal. In another embodiment, the 
invention provides a tissue derived from a non-naturally 
occurring seed plant in which AGLl expression and AGL5 

20 expression each are suppressed, where the seed plant is 
characterized by delayed seed dispersal. 

As used herein, the term ""tissue" means an 
aggregate of seed plant cells and intercellular material 
organized into a structural and functional unit. A 
25 particular useful tissue of the invention is a tissue 

that can be vegetatively or non-vegetatively propagated 
such that the seed plant from which the tissue was 
derived is reproduced. A tissue of the invention can be, 
for example, a seed, leaf, root or part thereof* 

30 As used herein, the term "seed" means a 

structure formed by the maturation of the ovule of a seed 
plant following fertilization- Such seeds can be readily 
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harvested from a non-naturally occurring seed plant of 
the invention characterized by delayed seed dispersal, 

A seed plant characterized by enhanced seed 
dispersal also can be produced by manipulating expression 
5 of an AGL8-like gene product or AGLl or AGL5. 

Suppression of AGL8-like gene product expression in a 
seed plant, for example, suppression of AGL8-like gene 
product expression in valve tissue, can be used to 
produce a seed plant characterized by enhanced seed 

10 dispersal. Ectopic expression of AGLl or AGL5, or both, 
in a seed plant, for example, premature expression of 
AGLl or AGL5, also can be used to produce a non-naturally 
occurring seed plant of the invention characterized by 
enhanced seed dispersal. The skilled person understands 

15 that these or other strategies of manipulating AGL8, AGLl 
or AGL5 expression can be used to produce a non-naturally 
occurring seed plant characterized by enhanced seed 
dispersal . 

The invention also provides a substantially 
20 purified dehiscence zone-selective regulatory element, 
which includes a nucleotide sequence that confers 
selective expression upon an operatively linked nucleic 
acid molecule in the valve margin or dehiscence zone of a 
seed plant, provided that the dehiscence zone-selective 
25 regulatory element does not have a nucleotide sequence 
consisting of nucleotides 1889 to 2703 of SEQ ID NO: 4. 

As used herein, the term "dehiscence 
zone-selective regulatory element" refers to a nucleotide 
sequence that, when operatively linked to a nucleic acid 
30 molecule, confers selective expression upon the 

operatively linked nucleic acid molecule in a limited 
number of plant tissues, including the valve margin or 
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dehiscence zone. As discussed above, the valve margin is 
the future site of the dehiscence zone and encompasses 
the margins of the outer replum as well as valve cells 
adjacent to the outer replum. The dehiscence zone, which" 
5 develops in the region of the valve margin, refers to the 
group of cells that separate during the process of 
dehiscence, allowing valves to come apart from the replum 
and the enclosed seeds to be released. Thus, a 
dehiscence zone-selective regulatory element, as defined 
10 herein, confers selective expression in the mature 

dehiscence zone, or confers selective expression in the 
valve margin, which marks the future site of. the 
dehiscence zone. 



A dehiscence zone-selective regulatory element 

15 can confer specific expression exclusively in cells of 
the valve margin or dehiscence zone or can confer 
selective expression in a limited number of plant cell 
types including cells of the valve margin or dehiscence 
zone. An AGL5 regulatory element, for example, which 

20 confers selective expression in ovules and placenta as 
well as in the dehiscence zone, is a dehiscence 
zone-selective regulatory element as defined herein. A 
dehiscence zone-selective regulatory element generally is 
distinguished from other regulatory elements by 

25 conferring selective expression in the valve margin or 

dehiscence zone without conferring expression throughout 
the adjacent carpel valves. 

The Arabidopsis AGLl gene (SEQ ID NO: 3) is 
shown in Figure 7, with the intron-exon boundaries 

30 indicated. The Arabidopsis AGL5 gene (SEQ ID NO: 4) is 
shown in Figure 8, with the intron-exon boundaries 
indicated. An AGLl or AGL5 regulatory element, such as a 
5* regulatory element or intronic regulatory element, can 
confer selective expression in the valve margin or 
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dehiscence zone and, thus, is a dehiscence-zone selective 
regulatory element as defined herein. The AGL5 gene, for 
example, is selectively expressed in the dehiscence zone, 
placenta and ovules, and an AGL5 regulatory element can 
5 confer selective expression in the dehiscence zone, 

placenta and ovules upon an operatively linked nucleic 
acid molecule. 



The invention provides a dehiscence 
10 zone-selective regulatory element that is an AGLl or AGL5 
regulatory element. Such a dehiscence zone-selective 
regulatory element can be, for example, an AGLl 
regulatory element. An AGLl regulatory element can have, 
for example, the nucleotide sequence of a non-coding 
15 portion of the Arabidopsis AGLl genomic sequence 

identified as SEQ ID NO: 3. A dehiscence zone-selective 
regulatory element also can be, for example, an AGL5 
regulatory element. An AGL5 regulatory element can have, 
for example, the nucleotide sequence of a non-coding 
20 portion of the Arabidopsis AGL5 genomic sequence 

identified as SEQ ID NO: 4, provided that the regulatory 
element does not have a nucleotide sequence consisting of 
nucleotides 1889 to 2703 of SEQ ID N0:4. 

As used herein, the term "substantially the 
25 nucleotide sequence, " when used in reference to an AGLl 
or AGL5 regulatory element, means a nucleotide sequence 
having an identical sequence, or a nucleotide sequence 
having a similar, non-identical sequence that is 
considered to be a functionally equivalent sequence by 
30 those skilled in the art. For example, a dehiscence 
zone-selective regulatory element that is an AGLl 
regulatory element can have, for example, a nucleotide 
sequence identical to the sequence of the Arabidopsis 
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AGLl regulatory element having nucleotides 1 to 2599 of 
SEQ ID NO: 3 shown in Figure 1, or a similar, 
non-identical sequence that is functionally equivalent.' 
A dehiscence zone-selective regulatory element can have, 
5 for example, one or more modifications such as nucleotide 
additions, deletions or substitutions relative to the 
nucleotide sequence shown in Figure 8, provided that the 
modified nucleotide sequence retains substantially the 
ability to confer selective expression in the valve 
10 margin or dehiscence zone upon an operatively linked 
nucleic acid molecule. 

It is understood that limited modifications can 
be made without destroying the biological function of an 
AGLl or AGL5 regulatory element and that such limited 

15 modifications can result in dehiscence zone-selective 

regulatory elements that have substantially equivalent or 
enhanced function as compared to a wild type AGLl or AGL5 
regulatory element. These modifications can be 
deliberate, as through site-directed mutagenesis, or can 

20 be accidental such as through mutation in hosts harboring 
the regulatory element. All such modified nucleotide 
sequences are included in the definition of a dehiscence 
zone-selective regulatory element as long as the ability 
to confer selective expression in the valve margin or 

25 dehiscence zone is substantially retained. 

A dehiscence zone-selective regulatory element 
can be derived from a gene that is an ortholog of 
Arabidopsis AGLl or AGL5 and is selectively expressed in 
the valve margin or dehiscence zone of a seed plant. A 
30 dehiscence zone-selective regulatory element can be 

derived, for example, from an AGLl or AGL5 ortholog of 
the Brasslcaceae , such as a Brassica napus ^ Brassica 
oleracBa , Brassica campestris, Brassica jvncea, Brassica 
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nigra or Brasslca carinata AGLl or AGL5 ortholog. A 
dehiscence zone-selective regulatory element can be 
derived, for example, from an AGLl or AGL5 canola 
ortholog. A dehiscence zone-selective regulatory element 
5 also can be derived, for example, from a leguminous AGLl 
or AGL5 ortholog, such as a soybean, pea, chickpea, moth 
bean, broad bean, kidney bean, lima bean, lentil, cowpea, 
dry bean, peanut, alfalfa, lucerne, birdsfoot trefoil, 
clover, stylosanthes ^ lotononis bainessil, or sainfoin 
10 AGLl or AGL5 ortholog. 



Dehiscence zone-selective regulatory elements 
also can be derived from a variety of other genes that 
are selectively expressed in the valve margin or 
dehiscence zone of a seed plant. For example, the 
15 rapeseed gene RDPGl is selectively expressed in the 
dehiscence zone (Petersen et al.. Plant Mol . 
Ri ol . 31:517-527 (1996), which is incorporated herein by 
reference). Thus, the RDPGl promoter or an active 
fragment thereof can be a dehiscence zone-selective 
20 regulatory element as defined herein. Additional genes 
such as the rapeseed gene SAC51 also are known to be 
' selectively expressed in the dehiscence zone; the SAC51 
promoter or an active fragment thereof also can be a 
dehiscence zone-selective regulatory element of the 
25 invention (Coupe et al.. Plant Mol . Biol . 23:1223-1232 
(1993), which is incorporated herein by reference). 
Further, genes selectively expressed in the dehiscence 
zone include the gene that confers selective GUS 
expression in the Arabldopsis transposant line GT140 
30 (Sundaresan et al.. Genes Devel . 9:1797-1810 (1995), 

which is incorporated herein by reference) . The skilled 
artisan understands that a regulatory element of any such 
gene selectively expressed in cells of the valve margin 
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or dehiscence zone can be a dehiscence zone-selective 
regulatory element as defined herein. 

Additional dehiscence zone-selective regulatory 
elements can be identified and isolated using routine 
methodology. Differential screening strategies using, 
for example, RNA prepared from the dehiscence zone and 
RNA prepared from adjacent pod material can be used to 
isolate cDNAs selectively expressed in cells of the 
dehiscence zone (Coupe et al., supra ^ 1993); 
subsequently, the corresponding genes are isolated using 
the cDNA sequence as a probe. 

Enhancer trap or gene trap strategies also can 
be used to identify and isolate a dehiscence 
zone-selective regulatory element of the invention 
15 (Sundaresan et al., supra ^ 1995; Koncz et al., Proc> 

Natl. Acad. Sci . USA 86:8467-8471 (1989); Kertbundit et 
al-, Proc. Nat]. Acad. Sci. USA 88:5212-5216 (1991); 
Topping et al.. Development 112:1009-1019 (1991), each of 
which "is incorporated herein by reference) . Enhancer 
20 trap elements include a reporter gene such as GUS with a 
weak or minimal promoter, while gene trap elements lack a 
promoter sequence, relying on transcription from a 
flanking chromosomal gene for reporter gene expression. 
Transposable elements included in the constructs mediate 
25 fusions to endogenous loci; constructs selectively 

expressed in the valve margin or dehiscence zone are 
identified by their pattern of expression. With the 
inserted element as a tag, the flanking dehiscence 
zone-selective regulatory element is cloned using, for 
30 example, inverse polymerase chain reaction methodology 
(see, for example, Aarts et al.. Nature 363:715-717 
(1993); see, also, Ochman et al., "Amplification of 
Flanking Sequences by Inverse PGR," in Innis et al.. 
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supra^ 1990) . The Ac/Ds transposition system of 
Sundaresan et al., supra, 1995, can be particularly 
useful in identifying and isolating a dehiscence 
zone-selective regulatory element of the invention. 

5 Dehiscence zone-selective regulatory elements 

also can be isolated by inserting a library of random 
genomic DNA fragments in front of a promoterless reporter 
gene and screening transgenic seed plants transformed 
with the library for dehiscence zone-selective reporter 
10 gene expression. The promoterless vector pROA97, which 
contains the npt gene and the GUS gene each under the 
control of the minimal 35S promoter, can be useful for 
such screening. The genomic library can be, for example, 
Sau3A fragments of Arahidopsis thaliana genomic DNA or 
15 genomic DNA from, for example, another Brasslcacea^ of 
interest (Ott et al,, MoJ . Gen . Genpt . 223:169-179 
(1990); Claes et al.. The Plant Journal 1:15-26 (1991), 
each of which is incorporated herein by reference) 

Dehiscence zone-selective expression of a 
regulatory element of the invention can be demonstrated 
or confirmed by routine techniques, for example, using a 
reporter gene and in situ expression analysis. The GUS 
and firefly luciferase reporters are particularly useful 
for in situ localization of plant gene expression 
(Jefferson et al . , EMBO J. 6:3901 (1987); Ow et al.. 
Science 334:856 (1986), each of which is incorporated 
herein by reference) , and promoterless vectors containing 
the GUS expression cassette are commercially available, 
for example, from Clontech (Palo Alto, CA) . To identify 
a dehiscence zone-selective regulatory element of 
interest such as an AGLl or AGL5 regulatory element, one 
or more nucleotide portions of the AGLl or AGL5 gene can 
be generated using enzymatic or PCR-based methodology 
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(Click and Thompson, supra, 1993; Innis et al., supra, 

1990) ; the resulting segments are fused to a reporter 

gene such as GUS and analyzed as described above. 

The present invention also provides a 
5 substantially purified dehiscence zone-selective 

regulatory element that confers selective expression upon 
an operatively linked nucleic acid molecule in the valve 
margin or dehiscence zone of a seed plant, where the 
element is an AGLl regulatory element having at least 

10 fifteen contiguous nucleotides of one of the following 
nucleotide sequences: nucleotides 1 to 2599 of SEQ ID 
N0:3; nucleotides 2833 to 4128 of SEQ ID NO:3; 
nucleotides 4211 to 4363 of SEQ ID N0:3; nucleotides 442G 
to 4554 of SEQ ID NO:3; nucleotides 4655. to 4753; 

15 nucleotides 4796 to 4878 of SEQ ID N0:3; nucleotides 4921 
to 5028 of SEQ ID NO: 3; or nucleotides 5361 to 5622 of 
SEQ ID NO: 3- A substantially purified dehiscence 
zone-selective regulatory element that is an AGLl 
regulatory element can have, for example, at least 16, 

20 18, 20, 25, 30, 40, 50, 100 or 500 contiguous nucleotides 
of one of the portions of SEQ ID NO: 3 described above. 

The present invention also provides a 
substantially purified dehiscence zone-selective 
regulatory element that confers selective expression upon 

25 an operatively linked nucleic acid molecule in the valve 
margin or dehiscence zone of a seed plant, where the 
element is an AGL5 regulatory element having at least 
fifteen contiguous nucleotides of one of the following 
nucleotide sequences: nucleotides 1 to 1888 of SEQ ID 

30 N0:4; nucleotides 2928 to 5002 of SEQ ID N0:4; 

nucleotides 5085 to 5204 of SEQ ID NO:4; nucleotides 5367 
to 5453 of SEQ ID NO: 4; nucleotides 5496 to 5602; 
nucleotides 5645 to 5734 of SEQ ID NO: 4; or nucleotides 
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6062 to 6138 of SEQ ID NO: 4. A substantially purified 
dehiscence zone-selective regulatory element that is an 
ACL5 regulatory element can have, for example, at least 
16, 18, 20, 25, 30, ^0, 50, 100 or 500 contiguous 
5 nucleotides of one of the portions of SEQ ID NO: 4 
described above. 



A proximal fragment of the Arabidopsis AGL5 
promoter has been described {Savidge et al,. The Plant 
Cel 1 7:721-733 (1995)). However, this fragment (shown as 

10 nucleotides 1889 to 2703 in Figure 8) lacks many of the 
distal regulatory elements contained in the entire 
Arabidopsis AGL5 genomic sequence disclosed herein (SEQ 
ID NO: 4). The present invention provides approximately 
2.7 kb of Arabidopsis AGL5 5' flanking sequence, 

15 including the variety of regulatory elements contained 
therein. The disclosed A^rabidopsis AGL5 5' flanking 
sequence contains a larger complement of regulatory 
elements involved in regulating expression of the 
endogenous AGL5 gene in vivo and, therefore, can be 

20 particularly useful for dehiscence zone-selective 
expression. 



A nucleotide sequence consisting of the 
promoter proximal region of Arabidopsis AGL5 (nucleotides 
1889 to 2703 of SEQ ID NO:4) is explicitly excluded from 

25 a dehiscence zone-selective regulatory element of the 
invention. However, a dehiscence zone-selective 
regulatory element can include nucleotides 1889 to 2703 
of SEQ ID NO: 4, together with one or more contiguous 
nucleotides, for example, of the nucleotide sequence 

30 shown as positions 1 to 1888 of SEQ ID N0:4. A 

dehiscence zone-selective regulatory element of the 
invention can have, for example, at least 15 contiguous 
nucleotides of SEQ ID NO: 4, including at least one, two. 
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four, six, ten, twenty or thirty or more contiguous 
nucleotides of the nucleotide sequence shown as positions 
1 to 1888 of SEQ ID NO:4.- 

view of the definition of a dehiscence 
zone-selective regulatory element, it should be 
recognized, for example, that a portion of the 
Arabidopsis AGL5 gene having only the sequence shown as 
nucleotides 1889 to 2703 in Figure 8 (SEQ ID N0:4), is 

10 not a dehiscence zone-selective regulatory element as 
defined herein. However, a portion of an Arabidopsis 
ACL5 gene having nucleotides 1885 to 2703 of SEQ ID N0:4 
is considered a dehiscence zone-selective regulatory 
element, provided that the element confers selective 

15 expression upon an operatively linked nucleic acid 

molecule in a limited number of plant tissues, including 
the valve margin or dehiscence zone. Similarly, a 
portion of an Arabidopsis AGL5 gene having a subpart of 
the promoter proximal region of AGL5 also can be a 

20 dehiscence zone-selective regulatory element as defined 
herein, provided that this subpart can confer selective 
expression upon an operatively linked nucleic acid 
molecule in a limited number of plant tissues, including 
the valve margin or dehiscence zone of a seed plant, 

25 Thus, for example, a regulatory element having the 

sequence of nucleotides 1889 to 2000 can be a dehiscence 
zone-selective regulatory element of the invention, 
provided that this element confers selective expression 
upon an operatively linked element in the valve margin or 

30 dehiscence zone of a seed plant. 

The present invention also provides a 
recombinant nucleic acid molecule that includes a 
dehiscence zone-selective regulatory element operatively 
linked to a nucleic acid molecule encoding a cytotoxic 
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gene product. Further provided herein is a non-naturally 
occurring seed plant of the invention that is 
characterized by delayed seed dispersal due to expression 
of a recombinant nucleic acid molecule having a 
5 dehi scence zone-select ive regulatory element operat ively 
linked to a nucleic acid molecule encoding a cytotoxic 
gene product . 

A cytotoxic gene product is a gene product that 
causes the death of the cell in which it is expressed 

10 and, preferably, does not result in the death of cells 
other than the eel 1 in which it is expressed. Thus, 
expression of a cytotoxic gene product from a dehiscence 
zone-selective regulatory element can be used to ablate 
the dehiscence zone without disturbing neighboring cells 

15 of the replum or valve . A variety of cytotoxic gene 
products useful in seed plants are known in the art 
including, for example, diphtheria toxin A chain 
polypeptides ; RNase Tl ; Barnase RNase ; ricin toxin A 
chain polypeptides ; and herpes simplex virus thymidine 

20 kinase (tk) gene products. While the diphtheria toxin A 
chain, RNase Tl and Barnase RNase are preferred cytotoxic 
gene products, the skilled person recognizes that these, 
or other cytotoxic gene products can be used with a 
dehiscence zone- select ive regulatory element to generate 

25 a non-naturally occurring seed plant characterized by . 
delayed seed dispersal . 

Diphtheria toxin is the naturally occurring 
toxin of CorJiejbacteriujH diphtheriae, which catalyzes the 
ADP-ribosylat ion of elongation factor 2, resulting in 
3 0 inhibition of protein synthesis and consequent cell death 
(Collier, Bacteriol . Rev . 39:54-85 (1915)), A single 
molecule of the fully active toxin is sufficient to kill 
a cell (Yamaizumi et al., Cell 15:245-250 (1978)). 
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understands that the toxicity of ricin depends is 
variable and should be assessed for toxicity in the seed 
plant species of interest (see Olsnes and Pihl, Molecular 
Action of Toxins and Viruses , pages 51-105, Amsterdam: 
5 Elsevier Biomedical Press (1982)). 



Further provided herein is a plant expression 
vector including a dehiscence zone-selective regulatory 
element. A plant expression vector can include, if 
desired, a nucleic acid molecule encoding an AGL8-like 
10 gene product in addition to the dehiscence zone-selective 
regulatory element . 



The term "plant expression vector, " as used 
herein, is a sel f - repl ica t ing nucleic acid molecule that 
provides a means to transfer an exogenous nucleic acid 
15 molecule into a seed plant host cell and to express the 
molecule therein. Plant expression vectors encompass 
vectors suitable for /igrobacteri u/rj-mediated 
transformation, including binary and cointegra t ing 
vectors, as well as vectors for physical transformation. 



20 Plant expression vectors can be used for 

transient expression of the exogenous nucleic acid 
molecule, or can integrate and stably express the 
exogenous sequence. One skilled in the art understands 
that a plant expression vector can contain all the 

25 functions needed for transfer and expression of an 

exogenous nucleic acid molecule; alternatively, one or 
more functions can be supplied in trans as in a binary 
vector system for ^grojbact eriu/n-mediated transformation. 



In addition to a dehiscence zone-selective 
30 regulatory element, a plant expression vector of the 

invention can contain, if desired, additional elements. 
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A binary vector for Agrobacteriu/n-mediated transformation 
contains one or both T-DNA border repeats and can also 
contain, for example, one or more of the following: a 
broad host range replicon, an ori T for efficient 
5 transfer from E. coli to Agrobacterlum ^ a bacterial 
selectable marker such as ampicillin and a polylinker 
containing multiple cloning sites. 



A plant expression vector for physical 
transformation can have, if desired, a plant selectable 

10 marker in addition to a dehiscence zone-selective 

regulatory element in vectors such as pBR322, pUC, pGEM 
and K13, which are commercially available, for example, 
from Pharmacia (Piscataway, NJ) or Promega (Madison, WI) . 
In plant expression vectors for physical transformation 

15 of a seed plant, the T-DNA borders or the ori T region 
can optionally be included but provide no advantage. 



The present invention also provides a kit for 
producing a transgenic seed plant characterized by 
delayed seed dispersal. A kit of the . invent ion contains 
20 a dehiscence zone-selective regulatory element. If 

desired, the dehiscence zone-selective regulatory element 
can be operatively linked to a nucleic acid molecule 
encoding an AGL8-like gene product. 

The following examples are intended to 
25 illustrate but not limit the present invention. 
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EXAMPLE I 

PRODUCTION OF A 3 5S-AGT.R TRANSGENIC ARABTDOPSIS PLANT 
DISPLAYIN G A COMPLETE T.ACK OF DEHISCENCE 

This example describes methods for producing a 
5 transgenic Arabidopsis plant lacking normal dehiscence 
due to constitutive AGL8 expression. 

Full-length AGL8 was prepared by polymerase 
chain reaction amplification using primer AGL8 5-y (SEQ 
ID NO: 9 ; 5 ' - CCGTCGACGATGGGAAGAGGTAGGGTT- 3 * ) and primer 
OAM14 (SEQ ID NO: 10; 5 ' - AATCATTACCAAGATATGAA- 3 M , and 
subsequently cloned into the Sail and BamHI sites of 
expression vector pBIN-JIT, which was modified from 
pBIN19 to include the tandem CaMV 35S promoter, a 
polycloning site and the CaMV polyA signal. Arabldopsls 
was transformed using the ±n plants method of 
Agrobacteriujn-mediated transformation essentially as 
described in Bechtold et al., C.R. Acad. Sci. Paris 
316:1194-1199 (1993), which is incorporated herein by 
reference. Kanamycin- resist ant lines were analyzed for 
the presence of the 35S-AGL8 construct by PGR using a 
primer specific for the 35S promoter and a primer 
specific for the AGLB cDNA, which produced two fragments 
of 850 and 550 bp in the 35S-AGL8 transgenic plants. 
These fragments were absent in plants that had not been 
transformed with the 35S-AGL8 construct. 

The phenotype of approximately 35 35S::AGL8 
lines was analyzed. Of the 35 lines, 7 lines exhibited a 
complete lack of dehiscence. In these lines, the mature 
fruits did not release their seeds unless opened 
30 manually.- Several of the remaining 3 5S: :AGL8 lines 

exhibited delayed dehiscence, whereby seeds were released 
at least a week later than in wild type Arabidopsis 
plants . 
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EXAMPLE II 

PRODUCTION OF AN ARABIDOPSIS aall aal5 double mutant 
DISPLAYING A COMPLETE LACK OF DEHISCENCE 

This example describes the production of an 
5 agll agl5 double mutant displaying a complete lack of 
normal dehiscence. 

A. Production of an aaI5 mutant by homologous 
recombina t ion 

A PCR-based assay of transgenic, plants was used 
10 to identify targeted insertions into AGL5 as described in 
Kempin et al.. Nature 389:802-803 (1997), which is 
incorporated herein by reference. The targeting 
construct consisted of a kanamycin-resist ance cassette 
that was inserted between approximately 3 kb 
15 and 2 kb segments representing the 5' and 3' regions of 
the AGL5 gene, respectively. A successfully targeted 
insertion produces a 1.6 kb deletion within the AGL5 gene 
such that the targeted allele encodes only the first 42 
of 246 amino acid residues, and only 26 of the 56 amino 
20 acids comprising the DNA-binding MADS-domain. The 

recombination event also results in the insertion of the 
2.5 kb kanamycin-resistance cassette within the AGL5 
coding sequence. 

750 kanamycin-resistant transgenic lines were 
25 produced by Agrobacterium-mediated transformation, and 

pools of transf ormants were analyzed using a PCR assay as 
described below to determine if any of these primary 
transf ormants had generated the desired targeted 
insertion into AGL5 , A single line was identified that 
30 appeared to contain the anticipated insertion, and this 
line was allowed to self -pollinate to permit further 
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analyses in subsequent generations. Genomic DNA from the 
homozygous mutant plants was analyzed with more than four 
different restriction enzymes and by several distinct PGR 
amplifications, and all data were consistent with the 
5 desired targeting event. The regions flanking the AGL5 
gene also were analyzed to verify that there were no 
detectable deletions or rearrangements of sequences 
outside of AGL5, 

The kanamycin-resistance cassette within the 
AGL5 targeting construct contains sequences that specify 
transcription termination such that little or no AGL5 RNA 
was expected in the homozygous mutant plants. Using a 
probe specific for the 3* portion of the AGL5 cDNA, AGL5 
transcripts were detected in wild-type but not in Bgl5 
mutant plants. These data indicate that the targeted 
disruption of the AGL5 gene represents a loss-of-f unction 
a 1 lele . 

Characterization of the agl5 line indicated 
that the phenotype of this transgenic was not different 
20 from wild type Arabidopsls . 

The AGL5 knockout (KO) construct was prepared 
in vector pZM104A, which carries the kanamycin-resistance 
cassette flanked by several cloning sites (Miao and Lam, 
Plant J. 7:359-365 (1995), which is incorporated herein 
by reference) . Vector pZM104A also contains the gene 
encoding ^"glucuronidase (GUS) , which allows the 
differentiation of non-homologous from homologous 
integration events-. The 3 kb region representing the 5' 
portion of AGL5 was obtained by PGR amplification using 
primer SEQ ID N0:11 ( 5 ' -CGGATAGCTCGAATATCG-3 ' ) and primer 
SEQ ID N0:12 ( 5 ' -7VACCATTGCGTCGTTTGC- 3 ' ) . The resulting 
fragment was cloned into vector pCRII ( Invitrogen) , and 
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an EcoRI fragment excised and inserted into the EcoRI 
site of pZM104A. The 3* portion of AGL5 was excised as 
an Xbal fragment from an AGL5 genomic clone in the vector 
pCIT30 (Ma et al . , Gene 117:161-167 (1992), which is 
incorporated by reference herein) and inserted into the 
Xbal site of pZM104A. The resulting plasmid, designated 
AGL5 KO, was used in Agrobacteri u/n-mediated infiltration 
of wild-type Arabidopsls plants of the Columbia ecotype. 
The knockout construct was derived from Landsberg erects 
genomic DNA. 

Plants containing a homologous recombination 
event at the AGL5 genomic locus were identified as 
follows. Approximately 750 primary (Tl) 

kanamycin-resistant t ransf ormant s were selected, and DNA 
15 was extracted from individual leaves in pools 

representing ten plants as described in Edwards et al.. 
Nucleic Acids Research 19:1349 (1991), which is 
incorporated by reference herein. To identify a pool 
that contained a candidate targeted disruption, isolated 
20 DNAs were subjected to PGR amplification using primer SEQ 
ID NO: 13 (5' -GTAATTACCAGGCAAGGACTCTCC-3 ' ) r which 
represents AGL5 genomic sequence that is not contained 
within the AGL5 KO construct, and primer SEQ ID NO: 14 

(5 •-GTCATCGGCGGGGGTCATAACGTG-3' ) / which is specific for 
25 the kanamycin-resistance cassette. Amplified 

products were size fractionated on agarose gels, and used 
for standard DNA blotting assays with probe 1. One pool 
of ten plants revealed the anticipated hybridizing band 
of the correct size, and this pool was subsequently 
30 broken down into individual plants. A single 

(Tl) plant was identified that appeared to contain the 

desired event, and this plant was allowed to 

self -pollinate for analyses in subsequent generations. 
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This Tl plant was shown to contain the GUS-reporter gene, 
indicating that in addition to the putative 
homologous integration event, there were independent 
non-homologous events. Segregation in the subsequent 
5 generations allowed the identification of plants that no 
longer contained the GUS-reporter gene, and it was these 
lines that were used for subsequent analyses. 

Plants homozygous for the disruption were 
identified by PGR amplification using primers SEQ ID 

10 NO: 15 (5 ' -GAGGATAGAG7\ACACTACGAATCG-3 * ) and SEQ ID NO: 16 
(5'-CAGGTCAAGTCAATAGATTC-3' ) , which yielded a single 1.5 
kb product in wild type plants, and a single 2.6 kb 
product in the mutant. Further confirmation that these 
plants contained the desired disruption was obtained by 

15 PGR amplification with primers SEQ ID NO: 17 

(5'-GAGAATTTAGTGAATAATATTG-3' ) and SEQ ID NO: 14, which 
gave the expected amplified product in the mutant but no 
product in wild-type plants. 

To confirm that the desired disruption had 
20 occurred, a series of genomic DNA blots representing 

wild-type and homozygous mutant (T4 generation) plants 
were analyzed. Probe 1 hybridized to the expected 3.9 kb 
Xbal fragment in wild-type and mutant plants, whereas the 
1.3 kb Xbal fragment was present only in wild-type. This 
25 same probe hybridized to a 6 kb EcoRI fragment in 
wild-type and to the expected 4.1 and 2.8 kb EcoRI 
fragments in the mutant. Additional digests 
with Bglll and with Hindlll confirmed that the mutant 
plants contained the desired targeted event. To confirm 
30 that there were no detectable deletions or rearrangements 
outside the targeted region, genomic DNA blots of wild 
type and homozygous mutant plants were further analyzed. 
Probe 2 hybridized in wild-type and mutant DNAs to the 
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expected 2.9 kb XmnI fragment, the 1.5 kb and 0.4 kb 
Hindi fragments, and the 0.6 kb Hindlll fragment. Probe 
3 hybridized in wild-type and mutant DNAs to the 9 kb 
Seal fragment, the 3.9 kb Xbal fragment, and the 
5 1 . 8 kb Ndel fragments. The faintly-hybridizing bands in 
the Seal digests represent fragments that span the 
insertion site, and are, as expected, different sizes in 
wild-type and agJ5 mutant plants. 

RNA blotting analyses were performed as 
10 follows. Approximately 6 jug of polyA+ RNA was purified 
using Dynabeads (Dynal) from wild-type and agJ3 mutant 
inflorescences, size fractionated and hybridized using 
standard procedures (Crawford et al., Proc. Natl. Acad. 
5c j . USA 83:8073-8076 (1986), which is incorporated 
15 herein by reference) using a gel-purified AbO bp 
Hindi! I-EcoRI fragment from pCIT2242 (Ma et al., 
supra, 1991) specific for the 3* end of the AGL5 cDNA. 
The same filter was subsequently stripped and 
re-hybridized with a t ubulin-speci f ic probe (Marks et 
20 al.. Plant Mol . Biol . 10:91-104 (1987), which is 

incorporated herein by reference) . Hybridization with 
the tubulin probe verified that approximately equal 
amounts of RNA were present in each lane. 

B. Production of an sail mvtant 

25 A PCR-based screen was used to identify a T-DNA 

insertion into the AGLl gene essentially as described in 
Krysan et al . , supra, 1996. 

RNA blotting analyses demonstrated that AGLl 
RNA was not expressed. The agli mutant displayed 
30 essentially a wild type phenotype. 
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C. Prodgctjon and characterization of an aall aal5 double 
mutant 

agll agl5 double mutants were generated by 
crossing the agll and agl5 single mutants. RNA blotting 
5 experiments of the agll agl5 double mutant are performed 
as described above. The results indicate that neither 
AGLl nor AGL5 RNA is expressed in the agll agl5 double 
mutant . 

In contrast to the agll and agl5 single 
10 mutants, which had essentially the phenotype of wild type 
Arabldopsls, analyses of the agll agl5 double mutant by 
scanning electron microscopy indicated that the 
dehiscence zone failed to develop normally. Furthermore, 
the mature fruits of the agll agl5 double mutant failed 
15 to dehisce. This delayed seed dispersal phenotype was 
similar to AGL8 gain-of-f unction phenotype seen in 
35S-AGL8 transgenic plants. These results indicate that 
the AGLl and AGL5 genes are functionally redundant and 
that their encoded gene products regulate pod dehiscence. 
20 The similarity of the 35S::AGL8 and agii agl5 double 

mutant phenotypes, as well the yeast two-hybrid results 
described below, indicate that AGLl and AGL8 or AGL5 and 
AGL8 can interact to regulate the dehiscence process. 

D, Analysis of dehiscence phenotypes under various 
2 5 condit ions 

Studies of pod dehiscence in Brassica napus L. 
using transmission electron microscopic analyses have 
shown that the middle lamella of the dehiscence zone 
cells degenerates during dehiscence, allowing the valves 
30 to separate from the replum {Petersen et al.. 
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supra, 1996). Similar analyses are performed on the agll 
agl5 double mutant as well as wild type Arabidopsis and 
agll and agl5 single mutants. 

Previous studies have shown that pod dehiscence 
5 is greater when temperatures are high and the relative 
humidity is low. The dehiscence phenotype of the agll 
agl5 double mutant described above was observed for 
plants grown under continuous-light at 25 degrees C. In 
order to determine if the phenotype of agll agl5 double 
10 mutants is sensitive to environmental conditions, the 
analyses described above are repeated under various 
environmental conditions including varying temperature, 
varying humidity and short-day versus continuous light 
conditions. 

15 EXAMPLE III 

PRODUCTION OF A TRANSGENIC ARABIDOPSIS PLANT EXPRESSING 
AGL8 UNDER CONTROL OF THE AGLl PROMOTER 

This example demonstrates that a transgenic 

seed plant expressing AGL8 under control of a dehiscence 

20. zone-selective promoter is characterized by delayed seed 
dispersal . 



AGL1::AGL8 transgenic plants 



Ectopic expression of AGL8 under control of the 
35S promoter prevents pod shatter since the dehiscence 

25 zone fails to differentiate normally. However, 

constitutive AGLB expression conferred by the 35S 
promoter also results in other changes, including early 
flowering. In order to specifically control dehiscence, 
AGLB is expressed from a dehiscence zone -selective 

3 0 regulatory element, such as one derived from a regulated 
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promoter that is normally expressed in valve margin, as 
described below. 



An AGL8 expression construct under control of 
the dehiscence zone-selective 2.5 kb AGLl promoter 

5 fragment and first AGLl intronic sequence is prepared as 

follows. The 2.5 kb AGLl promoter fragment is amplified 

by PGR with primers AGLlpds (SEQ ID N0:18; 
5 ' -GCCAGAGATAATGCTATTCC-3 • ) and AGLlpus (SEQ ID NO:19; 
5 ' -CATTGATGCATATATGACATCAC-3 • ) , and the first coding exon 
10 of AGL8 is amplified with oligos AGLSeds (SEQ ID NO:20; 

5 ' -GTGATGTGATATATGGATCAATGGGAAGAGGTAGGGTTGAG-3 ' ) and 
AGLSeus (SEQ ID NO: 21; 5 ' - CAAGAGTGGGTGGAATATTCG- 3 ' ) . In 
addition, the first intron of AGLl, which can contain 

regulatory elements, is amplified with oligos AGLlids 
15 (SEQ ID NO: 22; 5 ' - CGAATATTCGAGCGACTCTTGGTAGGCTTC 
TCCTACTCTAT-3 ' ) and AGLliup (SEQ ID NO:23; 

5 ' -CTAATAAGTAAGATCGCGGAA-3 ' ) . The remainder of the AGL8 
coding region is amplified with oligos AGL8rds (SEQ ID 
NO : 24 ; 5 * - TTCCGCGATCTTAGTTATTAGGATGGAGAGGATACTTGAAC - 3 ' ) 
20 and OAM14 (SEQ ID NO: 10) . Using PGR with oligos AGLlpds 
(SEQ ID NO:18) and OAM14 (SEQ ID NO:10), the four 
fragments are combined in the following order: AGLl 

promoter, first AGL8 exon, first AGLl intron and 

remainder of AGL8 coding sequence. The resulting 4.6 kb 

25 fragment is cloned into vector pCFM83, which is a vector 
based on pBIN19 that is modified to contain a BASTA 
resistance gene and 3* NOS termination sequence. 

A second AGL8 expression construct, in which 
AGL8 is under control of the dehiscence zone-selective 
3 0 2.5 kb AGLl promoter fragment alone, is prepared as 

follows. The 2.5 kb AGLl promoter fragment is amplified 

by PGR with oligo AGLlpds (SEQ ID NO: 18) and AGLlpus (SEQ 
ID NO:19), and the coding region of AGL8 amplified with 
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oligos AGL8eds (SEQ ID N0:20) and OAM14 (SEQ ID N0:10)- 

Using PGR with oligos AGLlpds (SEQ ID NO: 18) and OAM14 
{SEQ ID NO: 10), the 3.5 kb fragment is cloned into vector 
pCFM83 . 

5 Arabidopsis plants are transformed with the two 

AGL1-AGL8 constructs described above. BASTA resistant 
plants containing the AGLl::AGIi8 transgene with or 
without the AGLl intron are selected. Phenotypic 
analysis indicates that transformed plants containing 
10 either of these constructs are characterized by delayed 
dehiscence. However, the AGL1::AGL8 transgenic plants 
differ from 35S::AGL8 transgenic plants in that an 
enlarged fruit or early flowering phenotype generally is 
not seen. 



15 These results indicate that a transgenic seed 

plant expressing AGLB under control of an AGLl dehiscence 

zone- selective regulatory element is characterized by 
delayed seed dispersal . 

EXAMPLE IV 

2 0 AGL8 INTERACTS WITH AGL5 IN YEAST 

This example demonstrates that, in a yeast 
two-hybrid system, the AGL8 gene product interacts with 
AGL5, 



The "interaction trap" of Finley and Brent 
25 ( Gene Probes: A Practical Approach (1994); see, also 

Gyuris et al.. Cell 75:791-803 (1993)) is a variation of 
the yeast two-hybrid system of Fields and Song, Nature 
340:245-246 (1989). In this system, a first protein is 
fused to a DNA-binding domain, and a second is fused to a 
30 transcriptional activation domain. An interaction 
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between the Arahidopsis AGL5 and AGL8 gene products was 
assayed by activation of a lacZ reporter gene. 

The ''bait" and ""prey" constructs were prepared 
in single copy centromere plasmids pBI-880 and pBI-771, 
5 respectively, which each contain the constitutive ADHl 
promoter and are essentially as described by Chevray and 
Nathans, Proc. Natl. Acad. Sci . USA 89:5789-5793 (1992). 
The bait construct contains the GAL4 DNA-binding domain 
(amino acids 1 to 147) fused to the full-length AGL8 

10 coding sequence. The prey construct has the full-length 
coding sequence of AGL5 fused to the GAL4 transcriptional 
activation domain (amino acids 768-881), following a 
nuclear loca 1 i ::a t ion sequence. The bait and prey 
constructs were assayed in the YPB2 strain of S. 

15 cerevisiae r which is deficient for GAL4 and GAL60 and 
which contains an integrated lacZ reporter gene under 
control of GALl promoter elements (Feilotter et al.. 
Nucleic Acids Research 22:1502-1503 (1994)). 



An interaction of the AGLB ''bait" and AGL5 
''prey" was demonstrated in the YPB2 strain by the 
development of blue colonies on X-GAL containing media. 
Control ''bait "-"prey" combinations, including the 
GAL4 (1-147) DNA binding domain and GAL4 transcriptional 
activation domain only produced only white colonies. 
These results demonstrate that AGL8 can interact with 
AGL5 in yeast and indicate that the AGLB and AGL5 plant 
MADS box gene products also can interact in seed plants. 

All journal article, reference, and patent 
citations provided above, in parentheses or otherwise, 
30 whether previously stated or not, are incorporated herein 
by reference. 



20 



25 
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Although the invention has been described with 
reference to the examples above, it should be understood 
that various modifications can be made without departing 
from the spirit of the invention. Accordingly, the 
5 invention is limited only by the following claims. 
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SEQUENCE LISTING 



(1) GENERA.L INFORMATION: 

(i) APPLICANT: The Regents of the University of California 

(ii) TITLE OF INVENTION: Seed Plants Characterized by Delayed 
Seed Dispersal 

(iii) NUMBER OF SEQUENCES: 24 

(iv) CORRESPONDENCE ADDRESS: 

{A) ADDRESSEE: Campbell & Fiores LLP 

(B) STREET: 4370 La Jolla Village Drive, Suite 700 

(C) CITY: San Diego 
{D) STATE: California 

(E) COUNTRY: United States 

(F) ZIP: 92122 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING' SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release ^1.0, Version 4*1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

{A) APPLICATION NUMBER: US 60/051,030 
(B) FILING DATE: 27-JUN-1997 

(A) APPLICATION NUMBER: US 09/067,800 

(B) FILING DATE: 28-APR-1998 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: CaiT^pbell, Cathryn A. 

(B) REGISTRATION NUMBER: 31,815 

(C) REFERENCE/DOCKET NUMBER: FP-UD 3188 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (619) 535-9001 

(B) TELEFAX: (619) 535-8949 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE. TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(Bl LOCATION: 101.. 827 
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(ix) FEATURE: 

(A) NAME/KEY: misc^feature 

(B) LOCATION: 1062 

(D) OTHER INFORMATION: /not€= "There is a poly (A) tail at 
the end.** 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..1062 

(D) OTHER INFORMATION: /note= "Nucleotide and Deduced 
Amino Acid Sequences of the AGL8 cDNA clone," 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CCCAGAGAGA CATAAGAAAG AAAGAGAGAG AGAGATACTT TGGTCATTTC AGGGTTGTCG 60 

TTTCTCTCTC TTGTTCTTGA GATTTTGAAG AGAGAGAGAT ATG GGA AGA GGT AGG 11^ 

Met Gly Arg Gly Arg 

1 5 

GTT GAG CTG AAG AGG ATA GAG AAC AAG ATC AAT AGG CAA GTT ACT TTC 163 

Val Gin Leu Lys Arg lie Glu Asn Lys lie Asn Arg Gin Val Thr Phe 

10 15 20 

TCA AAG AGA AGG TCT GGT TTG CTC AAG AAA GCT CAT GAG ATC TCT GTT 211 
Ser Lys Arg Arg Ser Gly Leu Leu Lys Lys Ala His Glu lie Ser Val 

25 30 35 

CTC TGC GAT GCT GAG GTT GCT CTC ATC GTC TTC TCT TCC AAA GGC AAA 259 
Leu Cys Asp Ala Glu Val Ala Leu lie Val Phe Ser Ser Lys Gly Lys 
40 45 50 

CTC TTC GAA TAT TCC ACC GAG TCT TGC ATG GAG AGG ATA CTT GAA CGC 307 
Leu Phe Glu Tyr Ser Thr Asp Ser Cys Met Glu Arg lie Leu Glu Arg 
55 60 65 

TAT GAT CGC TAT TTA TAT TCA GAG AAA CAA CTT GTT GGC CGA GAC GTT 35 5 

Tyr Asp Arg Tyr Leu Tyr Ser Asp Lys Gin Leu Val Gly Arg Asp Val 
70 75 80 85 

TCA CAA ACT GAA AAT TGG GTT CTA GAA CAT GCT AAG CTC AAG GCA AGA 4 03 

Ser Gin Ser Glu Asn Trp Val Leu Glu His Ala Lys Leu Lys Ala Arg 

90 95 100 

GTT GAG GTA CTT GAG AAG AAC AAA AGG AAT TTT ATG GGG GAA GAT CTT 4 51 

Val Glu Val Leu Glu Lys Asn Lys Arg Asn Phe Met Gly Glu Asp Leu 

105 110 115 

GAT TCG TTG AGC TTG AAG GAG CTC CAA AGG TTG GAG CAT CAG CTC GAT 4 99 

Asp Ser Leu Ser Leu Lys Glu Leu Gin Ser Leu Glu His Gin Leu Asp 
120 125 130 

GCA GCT ATC AAG AGC ATT AGG TCA AGA AAG AAC CAA GCT ATG TTC GAA 54 7 

Ala Ala lie Lys Ser He Arg Ser Arg Lys Asn Gin Ala Met Phe Glu 
135 140 145 

TCC ATA TCT GCG CTC CAG AAG AAG GAT AAA GCC TTG CAA GAT CAC AAC 595 
Ser He Ser Ala Leu Gin Lys Lys Asp Lys Ala Leu Gin Asp His Asn 
150 155 160 165 

AAT TCG CTT CTC AAA AAG ATT AAG GAG AGG GAG AAG AAA ACG GGT CAG 64 3 

Asn Ser Leu Leu Lys Lys He Lys Glu Arg Glu Lys Lys Thr Gly Gin 
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170 175 180 

CAA GAA GGA CAA TTA GTC CAA TGC TCC AAC TCT TCT TCA GTT CTT CTG 691 
Gin Glu Gly Gin Leu Val Gin Cys Ser Asn Ser Ser Ser Val Leu Leu 

185 190 . 195 

CCT CAA TAC TGC GTA ACC TCC TCC AGA GAT GGC TTT GTG GAG AGA GTT 7 39 

Pro Gin Tyr Cys Val Thr Ser Ser Arg Asp Gly Phe Val Glu Arg Val 
200 205 210 

GGG GGA GAG AAC GGT GGT GCA TCG TCG TTG ACG GAA CCA AAC TCT CTG 787 
Gly Gly Glu Asn Gly Gly Ala Ser Ser Leu Thr Glu Pro Asn Ser Leu 
215 220 225 

CTT CCG GCT TGG ATG TTA CGT CCT ACC ACT ACG AAC GAG T AGAACTATCT 8 37 
Leu Pro Ala Trp Met Leu Arg Pro Thr Thr Thr Asn Glu 
230 235 240 

CACTCTTTAT AATATAATGA TAATATAATT AATGTTTAAT ATTTTCATAA CATTCAGCAT 8 97 

TTTTTTGGTG ACTTATACTC ATTATTAATA CCGATATGTT TTAGCTAGTC ATATTATATG 95 7 

TATGATGGAA CTCCGTTGTC GAGACGTATG TACGTAAGCT ATCATTAGAT TCACTGCGTC 1017 

TTAAGAuACAA AGATTCATAT CTTGGTAATG ATTTCTCATG AAATA 1062 

(2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Arg Gly Arg Val Gin Leu Lys Arg lie Glu Asn Lys He Asn 
15 10 15 

Arg Gin Val Thr Phe Ser Lys Arg Arg Ser Gly Leu Leu Lys Lys Ala 

20 25 30 

His Glu lie Ser Val Leu Cys Asp Ala Glu Val Ala Leu He Val Phe 
35 40 4b 

Ser Ser Lys Gly Lys Leu Phe Glu Tyr Ser Thr Asp Ser Cys Met Glu 
50 55 60 

Arg He Leu Glu Arg Tyr Asp Arg Tyr Leu Tyr Ser Asp Lys Gin Leu 
65 70 75 80 

Val Gly Arg Asp Val Ser Gin Ser Glu Asn Trp Val Leu Glu His Ala 

85 90 95 

Lys Leu Lys Ala Arg Val Glu 'Val Leu Glu Lys Asn Lys Arg Asn Phe 

100 105 110 

Met Gly Glu Asp Leu Asp Ser Leu Ser Leu Lys Glu Leu Gin Ser Leu 
115 120 125 

Glu His Gin Leu Asp Ala Ala He Lys Ser He Arg Ser Arg Lys Asn 
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130 



135 



140 



Gin Ala Met Phe Glu Ser He Ser Ala Leu Gin Lys Lys Asp Lys Ala 
145 150 155 160 
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Leu Gin Asp His 



Lys Lys Thr Gly 

180 

Ser Ser Val Leu 
195 

Phe Val Glu Arg 
210 



Asn Asn Ser Leu 
165 

Gin Gin Glu Gly 



Leu Pro Gin Tyr 

200 

Val Gly Gly Glu 
215 



Leu Lys Lys lie 
170 

Gin Leu Val Gin 
185 

Cys Val Thr Ser 



Asn Gly Gly Ala 

220 



Lys Glu Arg Glu 
175 

Cys Ser Asn Ser 
190 

Ser Arg Asp Gly 
205 

Ser Ser Leu Thr 



Glu Pro Asn Ser Leu Leu Pro Ala Trp Met Leu Arg Pro Thr Thr Thr 
225 230 235 240 

Asn Glu 

(2) INFORMATION FOR SEQ ID WO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE55 : unknown 
fD) TOPOLOGY: unknown 

a -A) FEATURE: 

(A) NAME/KEY: mi3C_feature 

(B) LOCATION: 1 . .5 622 

(D) OTHER INFORMATION: /labels AGLl_promote r 

/note= "Nucleotide sequence of the AGLl promoter." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AGATCTGCA/i. CAGTGAAAAG AGAAAACAAA ATGGACTTGA AGAGGTTTTG ACAATGCCAG 60 

AGATAATGCT TATTCCCTAA TATGTTGCCA GCCAAGTGTC' AAATTGGCTT TTTAAATATG 120 

GATTTCTGTA TCAGTGGTCA TATTTGTGGA TCCAACGTAT TCATCATCAA GTTCTCAAGT 180 

TTGCTTTCAG TGCAATTCTA ATTCACACGT TTTVACTTTAA CATGCATGTC ATTATAATTA 24 0 

CTTCTTCACT AAGACACAAT ACGGCAAACC TTTCAGATTA TATTAATCTC CATAAATGAA 300 

ATAATTAACC TCATAATCAA GATTCAATGT TTCTAAATAT ATATGGACAA AATTTACACG 360 

GAAGATTAGA TACGTATATT AGTAGATTTA GTCTTTCGTT TGTGCGATAA GATTAACCAC 4 20 

CTCATAGATA GTAATATCAT TGTCAAATTC CTCTCGGTTT AGTCGCTAAA TTGTATCTTT 4 80 

TTTAAGCCTA AAAGTAGTGT ATTCGCATAT GACTTATCGT CCTAACTTTT TTTTTAATTA 54 0 

ACAAAAAAAT CGAAAAGAAA ATAATCTGTT AAATATTTTT TAAGTACTCC ATTAAGTTTA 600. 

GTTTCTATTT AAAAAATGCT TGAAJ^.TTTGA CAGTTATGTT CAACAATTTT GAATCATGAG 660 

CGATGTCTAG ATACTCAGAA TTTAATCAAG ATGTCTTATC AAATTTGTTG TCACTCGAGG 720 

ACCCACGCAA AAGAAAAGAC TAATATGATT TTTATTTGGT CTGGATATTT TTGTAGAGGA 78 0 
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TGAAACTAAG AGAGTGAAAG ATTCGAAATC CACAATGTTC AAGAGAGCTC AAAGCAAAAA 84 0 

GAAAAATGAA GATGAAGGAC TAAAGAACAA TAAGCAACTA CTTATACCCT ATTTCCATAA 900 

AGGATTCAGG TACTAGGAGA AGTTGAGGCA AGTTNNNNNN NATTGATTCA AATTTTCATT 960 

TATTTTTACA ATTTAATTCA CCTAAGTTAT TATGCATTTC. TCATCATTGG TACATTTTCT 1020 

GTATAGCGTA TTTACATATA TGAAATAAAT TAAATATGTC CTCACGTTGC AAGTAGTTAA 108 0 

TGAATGTCCC CACGCAAAAA AAAATCCCTC CAAATATGTC CACCTTTTCT TTTCTTTTTA 114 0 

ATTCCAAAAT TACCATAAAC TTTTGGTTTA CAAAAGATTT CTAGAAATTG AGGAAGATAT 1200 

CCTAAATGAT TCATGAATCC TTCAATAATC TGAAGTTTGC GATATTTTCG ATTTTCTTCA 12 60 

AGAGTTGCGA TATTTGTAAT TTGGTGACCT TAAACTTTTT TTGATAAAGA GTAAACGTTT 1320 

TTTCTTAAAA GTAAAACTTG ATTTTATGTT TTAGGGTTCT AGCTCAACTT TGTATTATAT 138 0 

TTCTTGCAAA AAGAGTTCGT TAACTGCATT CTTCAACACT ATAAAGTGAT TATCAAAAAC 14 40 

ATCTTCATGA ACATTAAGAA AAACAATATT TGGTTTCGGT TAGAGCTTGG TTTTGCTTGG 1500 

CTTGATTCAC ATACCCATTC TAGACTTTGG CATAAATTTG ATACGATAGA GAGTATCTAA 1560 

TGGTAATGCA GAAGGGTAAA AAAAGGAAGA GAGAAAAGGT GAGAAAGATT ACCAAAAATA 1620 

AGGAGTTTCA AA.AGATGGTT CTGATGAGAA ACAGAGCCCA TCCCTCTCCT TTTCCCCTTC 1680 

CCATGAAAGA AATCGGATGG TCCTCCTTCA ATGTCCTCCA CCTACTCTTC TCTTCTTTCT 17 4 0 

^^^^^^^^^^ CTTATTATTA ACCATTTAAT TAATTTCCCC TTCAATTTCA GTTTCTAGTT 1800 

CTGT;JWVAG AAAATACACA TCTCACTTAT AGATATCCAT ATCTATTTAT ATGCATGTAT 18 60 

AGAGAATAAA AAAGTGTGAG TTTCTAGGTA TGTTGAGTAT GTGCTGTTTG GACAATTGTT 1920 

AGATGATCTG TCCATTTTTT TCTTTTTTCT TCTGTGTATA AATATATTTG AGCACAAAGA 1980 

AAAACTAATA ACCTTCTGTT TTCAGCAACT AGGGTCTTAT AACCTTCAAA GAAATATTCC 204 0 

TTCAATTGAA AACCCATAAA CCAAAATAGA TATTACAAAA GGAAAGAGAG ATATTTTCAA 2100 

GAACAACATA ATTAGAAAAG CAGAAGCAGC AGTTAAGTGG TACTGAGATA AATGATATAG 2160 

TTTCTCTTCA AGAACAGTTT CTCATTACCC ACCTTCTCCT TTTTGCTGAT CTATCGTAAT 2220 

CTTGAGAACT CAGGTAAGGT TGTGAATATT ATGCACCATT CATTAACCCT AAAAATAAGA 2280 
GATTTAAAAT AAATGTTTCT TCTTTCTCTG ATTCTTGTGT AACCAATTCA TGGGTTTGAT 2 34 0 

ATGTTTCTTG GTTATTGCTT ATCAACAAAG AGATTTGATC ATTATAAAGT AGATTAATAA 24 00 

CTCTTAAACA CACAAAGTTT CTTTATTTTT TAGTTACATC CCTAATTCTA GACCAGAACA 24 60 
TGGATTTGAT CTATTTCTTG GTTATGTATC TTGATCAGGA AAAGGGATTT GATCATCAAG 2520. 
ATTAGCCTTC TCTCTCTCTC TCTAGATATC TTTCTTGAAT TTAGAAATCT TTATTTAATT 2580 
ATTTGGTGAT GTCATATATG GATCAATGGA GGAAGGTGGG AGTAGTCACG ACGCAGAGAG 264 0 

TAGCAAGAAA CTAGGGAGAG GGAAAATAGA GATAAAGAGG ATAGAGAACA CAACAAATCG 27 00 
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TCAAGTTACT TTCTGCAAAC GACGCAATGG TCTTCTCAAG AAAGCTTATG AACTCTCTGT 2760 

CTTGTGTGAT GCCGAAGTTG CCCTCGTCAT CTTCTCCACT CGTGGCCGTC TCTATGAGTA 2820 

CGCCAACAAC AGGTACGCTT CTCCTACTCT ATTTCTTGAT CTTGTTTTCT TAATTTTAAC 288 0 • 

TAAACAAGAT CCTAGTTCAA ATGATAACAA AGTGGGGATT GAGAGCCAAG ATTAGGGTTT 294 0 

GGTTAATTTA GAAAACCAGA TTTCACTTGT TGATACATTT AATATCTCTC TAGCTAGATT 3000 

TAGTACTCTC TCCTCTATAT ATGTGTGGGT GTGTGTGTAA GTGTGTATAT GTATGCAAAT 3060 

GCAAGAAGAA GAAGAAAAAG TTATCTTGTC TTCTCAAATT CTGATCAGCT TTGACCTTAG 3120 

TTTCACTCTT TTTTCTGCAA ATCATTTGAA CCTGATGCAT GTCAGTTTCT ACAATACACT 3180 

TTTAATTTTG ACGGCCCATC AAATTTCCTA GGGTTTACTT CAGTGAACAA AATTGGGTTC 324 0 

TTGACACGAT TTAGCATGTA TATATAAAAA TAGGGGATGA TCAAGACTTA TGTAACCTCT 3300 

GTCTGGTGAA ACTAGGGACA AAGTCTACTG ATGAGTTGTC ACTAGGGATC CATTTGATCA 3360. 

TTTAATCCCA ACAAAAATGA AACAAAATTT TGAGAATTTA TATGCTGAAG TTTTTCAACC 34 20 

CTCTTTTTTA AJ^VTAACTTTA TATTATGTAG ATTTGTATTT AGGGTAATTT GTCCAACTAG 3 4 80 

AAGTCCTAAA AATCAATAAA CACACGGATG ACTTTGTCTA ACATTGTATC AGTCATCAAA 354 0 

TGTAAAATTG TAC.AAATAAT GAAATTAAAG ATTTAGTCTC TTTTATTTTT TTTGTTTAGG 3600 

GTGTATATAT ATATATATAT GTATATTTGT TGCATTGATA TATCAATGAG AGGGAGAGAA 3660 

CTCAGAGAAG TGTCGGAAAT TAAAATGGTA CGAGCCAATT GGAATCTCTG GCATTCTGAG 3720 

CTTCATTTGT TTGTTATTAG AAAAAAAAAA AAAAAATCCT TTAAAGATAC CTTCATGATG 37 80 

ACATTGAATC ATGTAATATA CACGATACAT GGTCTAATTC CTCCTCAAAC CCTAATTACC 38 4 0 

AATTTCGAAA CCATAATATT TACTAGTATG TTTATATATC CTTACTTTAA GACATTGTTT 3900 

GTTTATAATA CCTTGTGAAT TAAGAAAAAA AAAAAAAAAC TTGTGGATCT ATTCAAGCCA 3960 

TGTGTTAGAA TAAATTTATA AATTTTCTCC TCGTACTGGT CAGATATTGG TCCAAACTCC 4020 

AAAGCCTTCC CTTTTCAGGA AAAAAAACAT TTCGAAATTA ACTCTAATTA ATCAAGAATT 4080 

TCCTACAATG TATACATCTA ATGTTTTTTC CGCGATCTTA CTTATTAGTG TGAGGGGTAC 414 0 

AATTGAAAGG TACAAGAAAG CTTGTTCCGA TGCCGTCAAC CCTCCTTCCG TCACCGAAGC 4 200 

TAATACTCAG GTACCT^TTT ATATTGTTTG ATTCTCTTTG TTTTATCTTC TTCTTTTCAT 4 2 60 

TATATATATG ATCAACAAAA AATATAACCT ACAAAAAGAG AGAGTTCAAG GAAATGCATT 4 320 

GAAACGGTTT CGTTATGGTG TTTGAATACA TGGATTTTTG AAGTACTATC AGCAAGAAGC 4 380 

CTCTAAGCTT CGGAGGCAGA TTCGAGATAT TCAGAATTCA AATAGGTAAT TCATTAACTT 4 4 40 

TTCATGAACT CTTCGATTTG GTATTAGGTC ACTTAATTTG GTGTCGGTCC AAAAGTCCGC 4 500 

TTGTAGTTTT CTTTAGAAGT TGTTTTGTTT AATGTTCATG TTTACAAATT GAAGGCATAT 4 560 

TGTTGGGGAA TCACTTGGTT CCTTGAACTT CAAGGAACTC AAAAACCTAG AAGGACGTCT 4 620 
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TGAAAAAGGA 


ATCAGCCGTG 


TCCGCTCCAA 


AAAGGTAAAA 


TCTACGTTGC 


TCTCTCTCTG 


4680 


TGTCTCTGTC 


TCTCTCTCTA 


TATATAGTCC 


CTTAGTTTAT 


ATAGTTCATC 


ACCCTTTTGT 


4740 


GAGAATTTTG 


CAGAATGAGC 


TGTTAGTGGC 


AGAGATAGAG 


TATATGCAGA 


AGAGGGTAAG 


4800 


AACGTTTCTC 


CCATTCCAAG 


TAATTAGATC 


TTTCTTCGTC 


TTTGTGAGGG 


TTTGAGTTTT 


4860 


CCCATAAATC 


ATGTGTAGGA 


AATGGAGTTG 


CAACACAATA 


ACATGTACCT 


GCGAGCAAAG 


4920 


GTTAGCCACG 


TTCTGTTCCA 


AATCTTAATC 


TCAATATCTA 


CTCTTTTCTT 


CATTGTATAA 


4980 


CTAAGATAAC 


GTGAATAACA 


AGAAAACTTT 


TGTTTTTGGG 


TTTAATAGAT 


AGCCGAAGGC 


5040 


GCCAGATTGA 


ATCCGGACCA 


GCAGGAATCG 


AGTGTGATAC 


AAGGGACGAC 


AGTTTACGAA 


5100 


TCCGGTGTAT 


CTTCTCATGA 


CCAGTCGCAG 


CATTATAATC 


GG7U\CTATAT 


TCCGGTGAAC 


5160 


CTTCTTGAAC 


CGAATCAGCA 


ATTCTCCGGC 


CAAGACCAAC 


CTCCTCTTCA 


ACTTGTGTAA 


5220 


CTCAAAACAT 


GATAACTTGT 


TTCTTCCCCT 


CATAACGATT 


AAGAGAGAGA 


CGAGAGAGTT 


5280 


CATTTTATAT 


TTATAACGCG 


ACTGTGTATT 


CATAGTTTAG 


GTTCTAATAA 


TGATAATAAC 


5340 


AAAACTGTTG 


TTTCTTTGCT 


TAATTAGATC 


AACATTTAAA 


TCCAAAGTTC 


TAAAACACGT 


5400 


CGAGATCCAA 


AGTTTGTCAT 


ACAAGATTAG 


ACGCATACAC 


GATCAGTTAA 


TAGATTTTAA 


5460 


GTGCCTTTTA 


ATATTTACAT 


ATAGTTGCAG 


CTTCGATTAG 


ATCATGTCCA 


CCAAACACTC 


5520 


ACAATTAGAG 


ACAAGCAAAA 


CTATAAACAT 


TGATCATT^AA 


ATGATTACAA 


CATGTCCATA 


5580 


AATTAATTAT 


GGATTACAAA 


AATAAAAACT 


TACAAAAGAT 


CT 




5622 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 



(ix) FEATURE: 

(A) NAME/KEY: inisc_f eature 

(B) LOCATION: 1..6138 

(D) OTHER INFORMATION; /label= AGL5_promot er 

/note= "Nucleot ide sequence of the AGL5 promoter . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GAATTCGTAA CAGAATTTAG TGAATAATAT TGTAATTACC AGGCAAGGAC TCTCCAAACG 60 

GATAGCTCGA ATATCGTTAT TAAAGAGTAA ATGATCCAAT ATGTAAGCCA TTGTTGATCA 120 

TCTAACATTG TTGGACTCTC TATTGCTCGA AATGATGCAT ACCTAATCAT TTATTCAGTT 180 

AACTATCAAG TTGCATTTGT AAAAACCAAA CATTTAAATT CAGATTTGAT ATCACTTACA 24 0 

GAGGATAGAG AAGCATGACT CCAGGCCTGC ATGCAACAAG AAAAAGGAAG AAAATAATGT 300 

TAAAAATTTG ACAAATATAG TGTTTATTTT TATTATATGA GACAGAATTT GAATAAAATC 360 
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CTACCCAACT 


AGAGCATCAA 


AACGTTTTGC 


AATCGCAATA 


ATGAAACCCA 


TTTTCTTTTT 


420 


GAGTTTTTAC 


TCTTCTTTCA 


Tt .^T T*. Tft TV 

ACAGAAACTT 


fT^ 'Vv TV TV. M ^\ 

TCTCPJ\ACGT 


CTTTAGCACT 


GTGACGTTAG 


4 80 


ATATATACAC 


AAAAGCTTGA 


AATTTCTTCA 


TV T\ TV TV TV Ts >s 

AGCAAAAGAA 


TCTTTGTGGG 


AGTTAAGGCA 


^ Jl ^v 

540 


ACAAGCCAGG 


TAAAGAATCT 


CCAACGCATT 


GTTACGTTTT 


CATGAACCTA 


TTTATTATAT 


600 


GTTCTAAG7VA 


AGAAAAAAAT 


ATCTCAAAGT 


TS. TV Tt ^k. 

AAACGTTGGA 


AATTTTCTGA 


TGAAGGGAAA 


660 


TCCAAAGTCT 


TGGGTTTAGT 


Ts fT^ TV TV 

ATCCCTATGA 


ATGGTATTTG 


GAATATGTTT 


TCGTCAAAAC 


720 


AAAAGATTCT 


TTTCTTTTTC 


ACAAGAGTTA 


GTGATCAATA 


ACTTATGCAC 


TAATTAATGA 


780 


GATTGGACGT 


ATACACAATT 


TGATTATGAT 


ACTTGAGTAA 


AAATCACCTG 


TCCTTTAATT 


840 


TGGAAATCTC 


TCTTTCTTAC 


CCATTTATAT 


ACTACTTCTT 


TTCATTAAAA 


TTAAATTTCA 


900 


ATTATCAATC 


ATCGTTCAAT 


TTGATAAAGA 


TTTAACATTT 


TTTGTCACAG 


GGCTAGTAAA 


960 


AGCAATCTTT 


ACATAATTCA 


TCTTTCTTAC 


ATATATATAT 


TACCTTTTTC 


TTCATTAGTA 


1020 


TTCTATTTGA 


TTATGATTAT 


TTTGTCATAA 


AGCTAGTAAA 


TTAAACACTC 


GATATGAGAA 


1080 


TTATATTACT 


TCACGCTAAT 


TAACTCTTAA 


CACAACAAGA 


ACTAGTGCAT 


ATTCAACTTT 


1140 


CAAAGCATAT 


ACTATATATT 


GAGAATATAG 


ACCACGAAAG 


TCAATCAAAA 


GACCTACCAG 


1200 


CTCTCATCAA 


GTTCTTTCTT 


GAAATGATTT 


TGCAGAATTT 


cca;^iCTTaat 


TAATTCGACA 


1260 


TGAATGTGAA 


AATGTGTGTT 


GCTCGTTAAG 


AAAATTGAAT 


AGAAGTAC/v/i 


TGAAAATGAT 


1320 


GAGGAATGGG 


CAAPJKChChA 


AAGAGTTTCC 


TTTCGTAACT 


ACA/iTTAATT 


AATGCAA.ATC 


1380 


TGAGAA.^\GGG 


TTCATGGATA 


ATGACTACAC 


ACATGATTAG 


TCATTCCCCG 


TGGGCTCTCT 


1440 


GCTTTCATTT 


ACTTTATTAG 


TTTCATCTTC 


TCTAATTATA 


TTGTCGCATA 


TATGATGCAG 


1500 


TTCTTTTGTC 


TAAATTACGT 


AATATGATGT 


AATTAATTAT 


CAA.2VATAA.^T 


ATTCAAATTG 


1560 


CCGTTGGACT 


/\ACC TAATGT 


CCAAGATTAA 


GACTTGAACA 


TAAGAATTTT 


GGAAAAACTA 


1620 


AACCAGTTAT 


AATATATACT 


CTTAAATTGC 


CATTTCTGAA 


CACAACCAAA 


TAATAATATA 


168 0 


TACTATTTAC 


AGTTTTTTTT 


AATTGGCAAG 


AACACTGAAA 


TCTTATTCAT 


TGTCTCGCTT 


1740 


GGTAGTTGAC 


AAGTTATAAC 


ACTCATATTC 


TV TV FT^ TV Tv 

ATATAACCCC 


ATTCTAACGT 


TGACGACGAA 


1800 


CACTCAT ATA 


AACCACCCAA 


ATTCTTAGCA 


TATTAGCTAA 


ATATTGGTTT 


AATTGGAAAT 


1860 


ATTTTTTTTA 


T\ TV f V t TV iA. TV Tt 

TATATAAAAT 


# • TV ^V TV 

GCCAGGTAAA 


TATTAACGAC 


ATGCAATGTA 


TATAGGAGTA 


1920 


GGGCAATAAA 


AAGAAAAGGA 


GAATAAAAAG 


GGATTACCAA 


AAAAGGAAAG 


TTTCCAAAAG 


1980 


GTGATTCTGA 


TGAGAAACAG 


AGCCCATACC 


TCTCTTTTTT 


CCTCTAAACA 


TGAAAGAAAA 


2040 


ATTGGATGGT 


CCTCCTTCAA 


TGCTCTCTCC 


CCACCCAATC 


CAAACCCAAC 


TGTCTTCTTT 


2100 


CTTTCTTTTT 


TCTTCTTTCT 


AATTTGATAT 


TTTCTACCAC 


TTAATTCCAA 


TCAATTTCAA 


2160 


ATTTCAATCT 


AAATGTATGC 


ATATAGAATT 


TAATTAAAAG 


AATTAGGTGT 


GTGATATTTG 


2220 


AGAAAATGTT 


AGAAGTAATG 


GTCCATGTTC 


TTTCTTTCTT 


TTTCCTTCTA 


TAACACTTCA 


2280 
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GTTTGAAAAA 


AAACTACCAA 


ACCTTCTGTT 


TTCTGCAAAT 


GGGTTTTTAA 


ATACTTCCAA 


2340 


AGAAATATTC 


CTCTAAAAGA 


AATTATAAAC 


CAAAACAGAA 


ACCAAAAACA 


AAAAATAAAG 


2400 


TTGAAGCAGC 


AGTTAAGTGG 


TACTGAGATA 


ATAAGAATAG 


TATCTTTAGG 


CCAATGAACA 


2460 


AATTAACTCT 


CTCATAATTC 


ATCTTCCCAT 


CCTCACTTCT 


CTTTCTTTCT 


GATATAATTA 


2520 


ATCTTGCTAA 


GCCAGGTATG 


GTTATTGATG 


ATTTACACTT 


TTTTTTAAAA 


GTTTCTTCCT 


2580 


TTTCTCCAAT 


CAAATTCTTC 


AGTTAATCCT 


TATAAACCAT 


TTCTTTAATC 


CAAGGTGTTT 


2640 


GAGTGCAAAA 


GGATTTGATC 


TATTTCTCTT 


GTGTTTATAC 


TTCAGCTAGG 


GCTTATAGAA 


2700 


ATGGAGGCTG 


GTGCGAGTAA 


TGAAGTAGCA 


GAGAGCAGCA 


AGAAGATAGG 


GAGAGGGAAG 


2760 


ATAGAGATAA 


AGAGGATAGA 


GAACACTACG 


AATCGTCAAG 


TCACTTTCTG 


CAAACGACGC 


2820 


AATGGTTTAC 


TCAAGAAAGC 


TTATGAGCTC 


TCTGTCTTGT 


GTGACGCTGA 


GGTTGCTCTT 


2880 


GTCATCTTCT 


CCACTCGAGG 


CCGTCTCTAC 


GAGTACGCCA 


ACAACAGGTA 


CACATCTTTT 


2940 


AGCTAGATCT 


TGATTTTGTT 


G/\ATTTTTTT 


TCTAGAATAA 


AGTTTCGACT 


CTTCTGGTGG 


3000 


GTTTTTCAAT 


CTTTATGGTC 


TCTTTATAGT 


TTTTTTCCTT 


AGTTTCTCTG 


AAGCTCAAAT 


3060 


CTCTTTAAAA 


ATCCCCAA/^ 


TTAGGGTTTG 


TTTAAAACTA 


GGGAACCCTA 


CTTTAACTTC 


3120 


TTTCTCTTAG 


TAAAAAAGCA 


GTGAGGGTCT 


TCTCTGATCA 


TTAATTAGCA 


TCCCCCATAC 


3180 


CTTCTTCCAG 


TCACTTTTTC 


TCCACAAATC 


CTTATAACAG 


TATCTATATA 


TGTATCTATT 


3240 


TATGTCACTT 


TGTACAAGAC 


ACTTCGATCA 


ATTTGATGAC 


CCATCAAGTT 


TTATTTCTGC 


3300 


AGATTGATCA 


TTAGGTTTCC 


ATCATAGTAA 


TGAAAAAGTA 


GGGTTCTTGA 


TAAAATTATA 


3360 


ATAATATATA 


TTATTTGGCT 


ATATAAAAAA 


GCTATGTAGA 


TTCCTTAAAA 


ATTGATTCAC 


3420 


TAGGGAGAGA 


CTAGTAGGTG 


TTTGTCTTCT 


GACACTTCTC. 


TAATCTTTTG 


GTGAATCCTT 


3480 


TTGTTAAATC 


AAGAAAATGA 


ATCAGGGACA 


AAGCTTATTG 


TTGAGTCACT 


TAATTAATCA 


3540 


TCCGATCCAT 


CAATCAAGAA 


AAATAACGAA 


ACAGAAAATT 


TTGATTTTTG 


ATTGTTATTT 


3600 


TCTCCACTTC 


AAGTTGGGGA 


CTTGTCATTT 


CCGTTTTTCT 


ATACGTTTCC 


AGCTATTAAC 


3660 


AGCTCATGTT 


CATTTCACCA 


TTTTGATTAT 


TTGTCTGCTT 


TTTAAAGATA 


AATGTTTTCA 


3720 


AAAATATTGT 


TTTTATTTGC 


TTGGCTAGTT 


AATACTATAA 


TTGAGGTTGA 


TGTATGACTA 


3780 


TAATCTATAA 


GTCAAGTCTC 


ATATCATGGA 


TCTAAGTTAA 


AACTAGTAAA 


TTTGTAGTTT 


38 4 0 


CAATGTGAAC 


TTTCACAACG 


ACTAAAGAAC 


TGATCTGAAG 


TTTATAATGG 


ACATGACTAA 


3900 


TTTGATTAAC 


AAAAGAGGAA 


TGCATTATGT 


ATGTAGAAAC 


ATGTGATATA 


TATATGTTTC 


3960 


TATTATCAAA 


AGTGTAGTTA 


ACTTTCTTAT 


TTCAAACACC 


CTCATGCTTT 


AGTAGTATCT 


4020 


TACTTTTGAC 


ATTTCTCAAC 


TTCAGCTTTC 


CATTATACAA 


CAGCACAATG 


TAAATTACTT 


4080 


GTATATGAAT 


ATGAAAGCAT 


AACGTTATGC 


AAAGATTTCT 


AGCTTTTCTT 


TTTCTGTTTT 


4140 


GC7W\AGATT 


TACAAATATC 


ATGTTCTTGG 


TAAAAACATA 


CTTGCCTCAG 


CCACATATGC 


4200 
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ATGTAAATGT AATGTTCAAA TATTAATTCA GGAAAAACAA AGAAGAAGCA AAATTAGCTT 4 2 60 

CTAGAGTAGG GAATCTATTG ACTTGACCTG AAAATCACTT CTTTTTCTTA AAGCCTAGTA -3 320 

GTGAATTTTT TAATCTAATT AGGCCAAAAT ATATACTAGC CTAAAATATA ATTTGGATTT 4 380 

TGTGTCGTAC ATAAATTGGG ACCAATTCCA ATTAACTAAG AGCATATGCA ATTCAAATTC 4 4 40 

TTTTTATTTT CTTCTCCGAT TTGCTACTTC TTTCTTTTGT ATGTTTTCAA ATTAGGATTA 4 500 

CACTTTTTTG GGGAAGTACA CATTAGGGTC TTCTCGAACT TTGATTATAC ATATATATAT 4 560 

ATATATATAT ATATAACTTT GTGAGATGTC ACTGTTAATA GATAATAGGC AATAACAATA 4 620 

ATATCCAAAA AAGAAGGCGC AAACAAATCA TATACTATAT GGTACTGGTC CATTCACTAT 4 680 

TTTGTCGGTT GAATTTAAGG TTTGGCGTAC AAACTTTGTT TCAAACCTTT ATTATTCCGT 4 74 0 

CTTTCTGTGT GTTTTGTATA TCCAGAAGAT AAAAATATCA ATTTCTTTAA CGACTTCATA 4 800 

TAT AT AT ATA TATATATATA TAT AT AT ATT TTTCTCTTCT GGTTTTAGTG TTTGAATCCA 4 8 60 

ACAGTTATAG TTTCGTGTGT CTTTGTTTTA CTTGTGGTGG TTTAAGTTTG AGATTTTCAC 4 92 0 

CGATTGCATC TATTTACATA TATAGCTACC ACAAAAAAGA TTGCATTTTA AAATCTTTTC 4 980 

CTTTGTGTGA ATGTTGATGA AGTGTGAGAG GAACAATAGA AAGGTACAAG AAAGCTTGCT 504 0 

CCGACGCCGT TAACCCTCCG ACCATCACCG AAGCTAATAC TCAGGTTAGC TTTTAATTAA 5100 

TACACCTAGC TAGCTAGTTC GTTAATTACT TAATTTCTTC TTCTTTTAGT TATCTGACCT 5160 

TTTTTTCACC TCTTGTAACA ATGATGGGAT CGAAATTGAT GAAGTACTAT CAGCAAGAGG 5220 

CGTCTAAACT CCGGAGACAG ATTCGGGACA TTCAGAATTT GAACAGACAC ATTCTTGGTG 5280 

AATCTCTTGG TTCCTTGAAC TTTAAGGAAC TCAAGAACCT TGAAAGTAGG CTTGAGAAAG 534 0 

GAATCAGTCG TGTCCGATCC AAGAAGGTAC ATCACTAACT CTCCATCAAT CTCCTTATCA 54 00 

TTGAATATAT ATCCATCTGA TTCTTGCCCG TTATATTTGG TTTTTCTCTC CAGCACGAGA 54 60 

TGTTAGTTGC AGAGATTGAA TACATGCAAA AAAGGGTAAA AGTAAAACCT ATCTTCCTTC 5520 

ACAATGAACT ACCCCTACTT TATTAGCAAC TTCTCTTTCT GATGATCATC TTTTTTATTT 5580 

TCTGTTGTCG CTTGCATTGT AGGAAATCGA GCTGCAAAAC GATAACATGT ATCTCCGCTC 564 0 

CAAGGTTTTA TACATAACTC TTTTTGGCAT TTTTGATCAT CATTTTTTTC CGGTAGACAA 5700 

TCTCTTGATG TGCAAATTCT AAATATCTCT GCAGATTACT GAAAGAACAG GTCTACAGCA 57 60 

ACAAGAATCG AGTGTGATAC ATCAAGGGAC AGTTTACGAG TCGGGTGTTA CTTCTTCTCA 5820 

CCAGTCGGGG CAGTATAACC GGAATTATAT TGCGGTTAAC CTTCTTGAAC CGAATCAGAA 588 0 

TTCCTCCAAC CAAGACCAAC CACCTCTGCA ACTTGTTTGA TTCAGTCTAA CATAAGCTTC 594 0 

TTTCCTCAGC CTGAGATCGA TCTATAGTGT CACCTAAATG CGGCCGCGTC CCTCAACATC 6000 

TAGTCGCAAG CTGAGGGGi^A CCACTAGTGT CATACGAACC TCCAAGAGAC GGTTACACAA 6060 

ACGGGTACAT TGTTGATGTC ATGTATGACA ATCGCCCAAG TAAGTATCCA GCTGTGTTCA 6120 
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GAACGTACGT CCGAATTC ^^^^ 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 base pairs 

(B) TYPE: nucleic acid 

(C) STEUVNDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7.. 7 53 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 896 

(D) OTHER INFORMATION: /note= "There is a poly (A) tail at 
the end of the cDNA sequence." 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION : 1 . .896 

(D) OTHER INFORMATION: /note= "AGLl cDNA and deduced 
protein sequences." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GGATCA ATG GAG GAA GGT GGG AGT AGT CAC GAC GCA GAG AGT AGC AAG 

Met Glu Glu Gly Gly Ser Ser His Asp Ala Glu Ser Ser Lys 
1 5 10 



48 



AAA CTA GGG AGA GGG AAA ATA GAG ATA AAG AGG ATA GAG AAC ACA ACA 96 
Lys Leu Gly Arg Gly Lys He Glu He Lys Arg He Glu Asn Thr Thr 
15 ' 20 25 30 

AAT CGT CAA GTT ACT TTC TGC AAA CGA CGC AAT GGT CTT CTC AAG AAA 14 4 

Asn Arg Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu Leu Lys Lys 

35 40 45 

GCT TAT GAA. CTC TCT GTC TTG TGT GAT GCC GAA GTT GCC CTC GTC ATC 192 
Ala Tyr Glu Leu Ser Val Leu Cys Asp Ala Glu Val Ala Leu Val He 

5,0 55 60 

TTC TCC ACT CGT GGC CGT CTC TAT GAG TAC GCC AAC AAC AGT GTG AGG 24 0 

Phe Ser Thr Arg Gly Arg Leu Tyr Glu Tyr Ala Asn Asn Ser Val Arg 
65 70 75 

GGT ACA ATT GAA AGG TAC AAG AAA GCT TGT TCC GAT GCC GTC AAC CCT 28 8 

Gly Thr He Glu Arg Tyr Lys Lys Ala Cys Ser Asp Ala Val Asn Pro 
80 85 90 

CCT TCC GTC ACC GAA GCT AAT ACT CAG TAC TAT CAG CAA GAA GCC TCT 336 
Pro Ser Val Thr Glu Ala Asn Thr Gin Tyr Tyr Gin Gin Glu Ala Ser 
95 100 105 HO 
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AAG CTT CGG AGG CAG ATT CGA GAT ATT CAG hP.T TCA AAT AGG CAT ATT 384 
Lys Leu A.rq Arg Gin lie Arg Asp lie Gin Asn Ser Asn Arg His lie 

lis 120 125 

GTT GGG GAA TCA CTT GGT TCC TTG AAC TTC AAG GAA CTC AAA AAC CTA 4 32 

Val Gly Glu Ser Leu Giy Ser Leu Asn Phe Lys Glu Leu Lys Asn Leu 

130 135 140 

GAA GGA CGT CTT GAA AAA GGA ATC AGC CGT GTC CGC TCC AAA AAG AAT 4 80 

Glu Gly Arg Leu Glu Lys Gly lie Ser Arg Val Arg Ser Lys Lys Asn 
145 150 155 

GAG CTG TTA GTG GCA GAG ATA GAG TAT ATG CAG AAG AGG GAA ATG GAG 528 
Glu Leu Leu Val Ala Glu lie Glu Tyr Met Gin Lys Arg Glu Met Glu 
160 165 170 

TTG CAA CAC AAT AAC ATG TAG CTG CGA GCA AAG ATA GCC GAA GGC GCC 57 6 

Leu Gin His Asn Asn Met Tyr Leu Arg Ala Lys Tie Ala Glu Gly Ala 
175 180 185 190 

AGA TTG A/\T CCG GAC CAG CAG GAA TCG AGT GTG ATA CAA GGG ACG ACA 624 
Arg Leu Asn Pro Asp Gin Gin Glu Ser Ser Val lie Gin Gly Thr Thr 

195 200 ' 205 

GTT TAC GAP-. TCC GGT GTA TCT TCT CAT GAC CAG TCG CAG CAT TAT AAT 672 
Val Tyr Glu Ser Giy Val Scr Ser His Asp Gin Ser Gin His Tyr Asn 

210 215 220 

CGG AAC TAT ATT CCG GTG AAC CTT CTT GAA CCG AAT CAG ChP. TTC TCC 7 20 

Arg Asn Tyr lie Pro Vu L Asn Leu Leu Glu Pro Asn Gin Gin Phe Ser 
225 230 235 

GGC CAA GAC CA,^ CCT CCT CTT CAA CTT GTG TAACTCAAAA CATGATAACT 7 70 

Gly Gin Asp Gin Pro Pro Leu Gin Leu Val 
240 245 

TGTTTCTTCC CCTCATAACG ATTAAGAGAG AGACGAGAGA GTTCATTTTA TATTTATAAC 8 30 

GCGACTGTGT ATTCATAGTT TAGGTTCTAA TAATGATAAT AACAA.AACTG TTGTTTCTTT 8 90 

GCTTCA 896 

(2) INFORH^^TTON FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24B amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Glu Glu Gly Gly Ser Ser His Asp Ala Glu Ser Ser Lys Lys Leu 
1 .5 10 . 15 

Gly Arg Gly Lys lie Glu lie Lys Arg lie Glu Asn Thr Thr Asn Arg 

20 25 30 

Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu Leu Lys Lys Ala Tyr 
35 40 45 
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Glu Leu Ser Val Leu Cys Asp Ala Glu Val Ala Leu Val lie Phe Ser 
50 55 60 

Thr Arg Gly Arg Leu Tyr Glu Tyr Ala Asn Asn Ser Val Arg Gly Thr 
65 70 75 80 

lie Glu Arg Tyr Lys Lys Ala Cys Ser Asp Ala Val Asn Pro Pro Ser 

85 90 95 

Val Thr Glu Ala Asn Thr Gin Tyr Tyr Gin Gin Glu Ala Ser Lys Leu 

100 105 110 

Arg Arg Gin lie Arg Asp He Gin Asn Ser Asn Arg His He Val Gly 
115 120 125 

Glu Ser Leu Gly Ser Leu Asn Phe Lys Glu Leu Lys Asn Leu Glu Gly 
130 135 140 

Arg Leu Glu Lys Gly He Ser Arg Val Arg Ser Lys Lys Asn Glu Leu 

150 155 160 

Leu Val Ala Glu He Glu Tyr Met Gin Lys Arg Glu Met Glu Leu Gin 

165 170 175 

His Asn Asn Mot Tyr Leu Arg Ala Lys He Ala Glu Gly Ala Arg Leu 

180 185 190 

Asn Pro Asp Gin Gin Glu Ser Ser Val He Gin Gly Thr Thr Val Tyr 
1^5 200 205 

Glu Ser GJy Val Ser Ser His Asp Gin Ser Gin His Tyr Asn Arq Asn 
210 215 220 

Tyr He Pro Val Asn Leu Leu Glu Pro Asn Gin Gin Phe Ser Gly Gin 
225 230 235 240 

Asp Gin Pro Pro Leu Gin Leu Val 



245 



(2) INFOPJ^TIOH FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 959 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY 

(B) LOCATION 



CDS 

78 . .818 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1 . , 95 9 

CD) OTHER INFORMATION: /note 
protein sequences," 



= "AGL5 cDNA and deduced 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 
GAATTCATCT TCCCATCCTC ACTTCTCTTT CTTTCTGATC ATAATTAATC TTGCTAAGCC 60 

AGCTAGGGCT TATAGAA ATG GAG GGT GGT GCG AGT AAT GAA GTA GCA GAG 110 

Met Glu Gly Gly Ala Ser Asn Glu Val Ala Glu 
1 5 10 

AGC AGC AAG AAG ATA GGG AGA GGG AAG ATA GAG ATA AAG AGG ATA GAG 158 
Ser Ser Lys hys He Gly Arg Gly Lys He Glu He Lys Arg He Glu 

15 20 25 

AAC ACT ACG AAT CGT CAA GTC ACT TTC TGC AAA CGA CGC AAT GGT TTA 206 
Asn Thr Thr Asn Arg Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu 
30 35 40 

CTC AAG AAA GCT TAT GAG CTC TCT GTC TTG TGT GAC GCT GAG GTT GCT 2 54 

Leu Lys Lys Ala Tyr Glu Leu Ser Val Leu Cys Asp Ala Glu Val Ala 
45 50 55 



CTT GTC ATC TTC TCC ACT CGA GGC CGT CTC TAG GAG TAC GCC AAC AAC 
Leu Val Ho Pho Ser Thr Arg Gly Arg Leu Tyr Glu Tyr Ala Asn Asn 
60 65 70 75 



GTT AAC CCT CCG ACC ATC ACC GAA GCT AAT ACT CAG TAC TAT GAG CAA 
Val Asn Pro Pro Thr He Thr Glu Ala Asn Thr Gin Tyr Tyr Gin Gin 

^5 100 10^. 



AGA CAC ATT CTT GGT GAA TCT CTT GGT TCC TTG AAC TTT AAG GAA CTC 
Arg His He Leu Gly Glu Ser Leu Gly Ser Leu Asn Phe Lys Glu Leu 
125 130 135 



GAA AGA ACA GGT CTA CAG CAA CAA GAA TCG AGT GTG ATA CAT CAA GGG 
Glu Arg Thr Gly Leu Gin Gin Gin Glu Ser Ser Val He His Gin Gly 
190 195 200 



302 



AGT GTG AGA CGA ACA ATA GAA AGG TAC AAG AAA GCT TGC TCC GAC GCC 350 
Ser Veil Arq Gly Thr He Glu Arg Tyr Lys Lys Ala Cys Ser Asp Ala 

80 85 90 



398 



GAG GCG TCT AAA CTC CGG AGA CAG ATT CGG GAC ATT CAG AA.T TTG AAC 44 6 

Glu Ala Ser Lys Leu Arg Arg Gin He Arg Asp He GJ n Asn Leu Asn 
110 115 120 



A9A 



AAG AAC CTT GAA AGT AGG CTT GAG AAA GGA ATC AGT CGT GTC CGA TCC 54 2 

Lys Asn Leu Glu Ser Arg Leu Glu Lys Gly He Ser Arg Val Arg Ser 

140 145 150 155 

AAG AAG CAC GAG ATG TTA GTT GCA GAG ATT GAA TAC ATG CAA AAA AGG 5 90 

Lys Lys His GJu Met Leu Val Ala Glu He Glu Tyr Met Gin Lys Arg 

160 165 170 

GAA ATC GAG CTG CAA AAC GAT AAC ATG TAT CTC CGC TCC AAG ATT ACT 638 

Glu He Glu Leu Gin Asn Asp Asn Met Tyr Leu Arg Ser Lys He Thr 

175 180 185 



686 



ACA GTT TAC GAG TCG GGT GTT ACT TCT TCT CAC CAG TCG GGG CAG TAT 7 34 

Thr Val Tyr Glu Ser Gly Val Thr Ser Ser His Gin Ser Gly Gin Tyr 
205 210 215 
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AAC CGG AAT TAT ATT GCG GTT AAC CTT CTT GAA CCG AAT CAG AAT TCC 7 82 

Asn Arg Asn Tyr lie Ala Val Asn Leu Leu Glu Pro Asn Gin Asn Ser 
220 225 230 235 

TCC AAC CAA GAG CAA CCA CCT CTG CAA CTT GTT TGATTCAGTC TAACATAAGC 8 35 

Ser Asn Gin Asp Gin Pro Pro Leu Gin Leu Val 

240 245 

TTCTTTCCTC AGCCTGAGAT CGATCTATAG TGTCACCTAA ATGCGGCCGC GTCCCTCAAC 8 95 

ATCTAGTCGC AAGCTGAGGG GAACCACTAG TGTCATACGA ACCTCCAAGA GACGGTTACA 955 

CAAA 95 9 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Glu Gly Gly Ala Ser Asn Glu Val Ala Glu Ser Ser Lys Lys lie 
1 5 10 15 

Gly Arg Gly Lys lie Glu lie Lys Arg lie Glu Asn Thr Thr Asn Arg 

20 25 30 

Gin Val Thr Phe Cys Lys Arg Arg Asn Gly Leu Leu Lys Lys Ala Tyr 
35 40 45 

Glu Leu Ser Val Leu Cys Asp Ala Glu Val Ala Leu Val lie Phe Ser 
50 55 60 

Thr Arg Gly Arg Leu Tyr Glu Tyr Ala Asn Asn Ser Val Arg Gly Thr 
65 10 75 80 

lie Glu Airg Tyr Lys Lys Ala Cys Ser Asp Ala Val Asn Pro Pro Thr 

85 90 95 

He Thr Glu Ala Asn Thr Gin Tyr Tyr Gin Gin Glu Ala Ser Lys Leu 

100 105 110 

Arg Arg Gin lie Arg Asp He Gin Asn Leu Asn Arg His He Leu Gly 
115 120 125 

Glu Ser Leu Gly Ser Leu Asn Phe Lys Glu Leu Lys Asn Leu Glu Ser 
130 135 140 

Arg Leu Glu Lys Gly He Ser Arg Val Arg Ser Lys Lys His Glu Met 
145 150 155 160 

Leu Val Ala Glu He Glu Tyr Met Gin Lys Arg Glu He Glu Leu Gin 

165 170 175 

Asn Asp Asn Met Tyr Leu Arg Ser Lys He Thr Glu Arg Thr Gly Leu 

180 185 • 190 
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Gin Gin Gin Glu Ser Ser Val lie His Gin Gly Thr Val Tyr Glu Ser 
195 200 205 

Gly Val Thr Ser Ser His Gin Ser Gly Gin Tyr Asn Arg Asn Tyr He 
210 215 220 

Ala Val Asn Leu Leu Glu Pro Asn Gin Asn Ser Ser Asn Gin Asp Gin 
225 230 235 240 

Pro Pro Leu Gin Leu Val 

245 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ix) FEATURE: 

(A) NAME/KEY: mi s c_f ea t ure 

(B) LOCATION: 1 . . 27 

(D) OTHER INFORMATION: /note= "Primer AGL8 5-4" 



t>:i) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCGTCGACGA TGGGAAGAGG TAGGGTT 27 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1. .20 

(D) OTHER INFORMATION: /note- "Primer OAM14 . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AATCATTACC AAGATATGAA 20 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CGGATAGCTC GAATATCG 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AACATTGCGT CGTTTGC 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2A base pairs 

(B) TYPE : nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GTAATTACCA GGC/"^GGACT CTCC 
(2) INFORMATION FOR SEQ ID N0:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

GTCATCGGCG GGGGTCATAA CGTG 

(2) INFORMATION FOR SEQ ID NO: IS: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAGGATAGAG AACACTACGA ATCG 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CAGGTCAAGT CAATAGATTC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Mi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CAGAATTTAG TGAATAATAT TG 
(2) INFOHr^TION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARPlCTERI ST I CS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 
GCCAGAGATA ATGCTATTCC 
(2) INFORMATION FOR SEQ ID NO : 1 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CATTGATCCA TATATGACAT CAC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GTGATGTCAT ATATGGATCA ATGGGAAGAG GTAGGGTTCA G 4 1 

(2) INFORMATION FOR SEQ ID N0:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CAAGAGTCGG TGGAATATTC G 21 
(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHAR^\CTERI STTCS : 

(A) LENGTH: 4 1 base pairs 

(B) TYPE: nucleic acid 

(C) STRAl^DEDNESS : Single 

(D) TOPOLOGY: linear 



(XI ) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CGAATATTCC ACCGACTCTT GGTACGCTTC TCCTACTCTA T 4 1 

(2) INFORMATION FOR SEQ TO NO: 23: 

(i) SEQUENCE CHARACTFHT GTICS : 

(A) LENGTH: 2 1 base pairs 
(R) TYPE: nuclcjc acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTAATAAGTA AGATCGCGGA A 21 
(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TTCCGCGATC TTACTT^TTA GCATGGAGAG GATACTTGAA C . 41 
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We claim: 

1. A non-naturally occurring seed plant, 
comprising an ectopically expressed nucleic acid molecule 

encoding an AGL8-like gene product, said seed plant 
5 characterized by delayed seed dispersal. 



2. The non-naturally occurring seed plant of 
claim 1, wherein said AGL8-like gene product has 
substantially the amino acid sequence of an AGL8 
ortholog . 

10 3. The non-naturally occurring seed plant of 

claim 2, wherein said AGL8-like gene product has the 
amino acid sequence of Arabidopsis AGL8 (SEQ ID N0:2). 



4. The non-naturally occurring seed plant of- 
claim 3, which is a transgenic seed plant, 

15 5. The transgenic seed plant of claim 4, 

wherein said ectopically expressed nucleic acid molecule 
encoding an AGL8-like gene product is operatively linked 
to an exogenous regulatory element. 



6. The transgenic seed plant of claim 5, 
20 wherein said exogenous regulatory element is a 
constitutive regulatory element. 



7. The transgenic seed plant of claim 6, said 
nucleic acid molecule comprising an exogenous nucleic 
acid molecule encoding substantially the amino acid 
25 sequence of an AGL8 ortholog operatively linked to a 
cauliflower mosaic virus 35S promoter. 
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8. 



The transgenic seed plant of claim 6, 



wherein said exogenous regulatory element is a dehiscence 
zone-selective regulatory element. 



5 wherein said dehiscence zone-selective regulatory element 
is selected from the group consisting of an AGLl 
regulatory element and an AGL5 regulatory element. 



wherein said nucleic acid molecule encoding an AGL8-like 
10 gene product is an exogenous nucleic acid molecule 

encoding substantially the amino acid sequence of an AGL8 
ortholog . 

11. The transgenic seed plant of claim 10, 
wherein said AGLB-like gene product has the amino acid 

15 sequence of Arabidopsis AGLB {SEQ ID NO:2). 

12. The transgenic seed plant of claim 9, 
wherein said dehiscence-zone selective regulatory element 
is an AGLl regulatory element comprising at least fifteen 
contiguous nucleotides of a nucleotide sequence selected 

20 from the group consisting of: 



9. 



The transgenic seed plant of claim 8, 



10. The transgenic seed plant of claim 3, 



nucleotides 1 to 2599 of SEQ ID N0:3; 



nucleotides 2833 to 4128 of SEQ ID N0:3; 



nucleotides 4211 to 4363 of SEQ ID NO:3; 



nucleotides 4426 to 4554 of SEQ ID N0:3; 



25 



nucleotides 4655 to 4753 of SEQ ID M0:3; 



nucleotides 4796 to 4878 of SEQ ID N0:3; 



nucleotides 4921 to 5028 of SEQ ID N0:3; and 



nucleotides 5421 to 5682 of SEQ ID NO: 3. 
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13. The transgenic seed plant of claim 9, 
wherein said dehiscence-zone selective regulatory element 
is an AGL5 regulatory element comprising at least fifteen 
contiguous nucleotides of a nucleotide sequence selected 
5 from the group consisting of: 

nucleotides 1 to 1888 of SEQ ID N0:4; 

nucleotides 2928 to 5002 of SEQ ID N0:4; 

nucleotides 5085 to 5204 of SEQ ID N0:4; 

nucleotides 5367 to 5453 of SEQ ID N0:4; 
10 nucleotides 5496 to 5602 of SEQ ID N0:4; 

nucleotides 5645 to 5734 of SEQ ID N0:4; and 

nucleotides 6062 to 6138 of SEQ ID N0:4. 



14. The non-naturally occurring seed plant of 
claim 1, which is a dehiscent seed plant. 

15 15, The non-naturally occurring seed plant of 

claim lA , which is a member of the Brassicaceae , 

16. The non-naturally occurring seed plant of 
claim 14, which is a member of the Fabaceae. 



17. A non-naturally occurring seed plant, in 
20 which AGLl expression and AGL5 expression each are 

suppressed, said seed plant characterized by delayed seed 
dispersal . 



18. The non-naturally occurring seed plant of 
claim 17, which is an agll agl5 double mutant. 
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19. A tissue derived from a non-naturally 
occurring seed plant, said seed plant comprising an 
ectopically expressible nucleic acid molecule encoding an 
AGL8-like gene product and characterized by delayed seed 
dispersal . 

20. The tissue of claim 19, which is a seed. 

21- A tissue derived from a non-naturally 
occurring seed plant, in which AGLl expression and AGL5 
expression each are suppressed, said seed plant 
characterized by delayed seed dispersal - 

22. The tissue of claim 21, which is a seed. 

23. A method of producing a non-naturally 
occurring seed plant characterized by delayed seed 
dispersal, comprising ectopically expressing a nucleic 
acid molecule encoding an AGL8-like gene product in said 
seed plant, whereby seed dispersal is delayed due to 
ectopic expression of said nucleic acid molecule. 

24 . A substantially purified dehiscence 
zone-selective regulatory element, comprising a 
nucleotide sequence that confers selective expression 
upon an operatively linked nucleic acid molecule in the 
valve margin or dehiscence zone of a seed plant, 

provided that said dehiscence zone-selective 
regulatory element does not have a nucleotide sequence 
consisting of nucleotides 1889 to 2703 of SEQ ID N0:4. 
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25. The substantially purified dehiscence 
zone-selective regulatory element of claim 24, which is 
selected from the group consisting of an AGLl regulatory 
element and an AGL5 regulatory element. 



5 26. The substantially purified dehiscence 

zone-selective regulatory element of claim 25, which is 
an AGLl regulatory element comprising at least fifteen 
contiguous nucleotides of a nucleotide sequence selected 
from the group consisting of: 
10 nucleotides 1 to 2599 of SEQ ID N0:3; 

nucleotides 2833 to 4128 of SEQ ID N0:3; 
nucleotides 4211 to 4363 of SEQ ID N0:3; 
nucleotides 4426 to 4554 of SEQ ID N0:3; 
nucleotides 4655 to 4753 of SEQ ID N0:3; 
15 nucleotides 4796 to 4878 of SEQ ID N0:3; 

nucleotides 4921 to 5028 of SEQ ID N0:3; and 
nucleotides 5361 to 5622 of SEQ ID N0:3. 

27. The substantially purified dehiscence 
zone-selective regulatory element of claim 25, which is 
20 an AGL5 regulatory element comprising at least fifteen 

contiguous nucleotides of a nucleotide sequence selected 
from the group consisting of: 

nucleotides 1 to 1888 of SEQ ID N0:4; 

nucleotides 2928 to 5002 of SEQ ID N0:4; 
25 nucleotides 5085 to 5204 of SEQ ID N0:4; 

nucleotides 5367 to 5453 of SEQ ID ^30:4; 

nucleotides 5496 to 5602 of SEQ ID N0:4; 

nucleotides 5645 to 5734 of SEQ ID NO: 4; and 

nucleotides 6062 to 6138 of SEQ ID NO: 4. 

30 28 . A plant expression vector, comprising a 

dehiscence zone-selective regulatory element. 
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29. A kit for producing a transgenic seed 
plant characterized by delayed seed dispersal, comprising 
a dehiscence zone-selective regulatory element having a 
nucleotide sequence that confers selective expression 
5 upon an operatively linked nucleic acid molecule in the 
valve margin or dehiscence zone of a seed plant, 

provided that said dehiscence zone^selective 
regulatory element does not have a nucleotide sequence 
consisting of nucleotides 1889 to 2703 of SEQ ID NO: 4. 

10 30- The kit of claim 29, said dehiscence 

zone-selective regulatory element is operatively linked 
to a nucleic acid molecule encoding an AGL8-like gene 
product . 
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CCCAGAGAGACATAAGAAAGAAAGAGAGAGAGAGATACTT 
TGGTCATTTCAGGGTTGTCGTTTCrCTCTCTTGTTCrTGAGATTT'l'GAAGAGAGAGAGAT 
1 ATGGGAAGAGGTAGGGTTCAGCTGAAGAGGATAGAGAACAAGATCAATAGGCAAGTTACT 
1 M G R GRVOLKRI E NKINROVT 

6 1 TTCTCAAAGAGAAGGTCTGGTTTGCrCAAGAAAGCTCATGAGATCTCrG^ 

21 FSKRRSGLLKKAHEI SVL CD 

121 GCTGAGGTTGCTCTCATCGTCTTCTCTTCCAAAGGCAA^ 
41 AEVALIVFSSKGKLFEY S T D 

181 TCTTGCATGG AGAGGATACTTGAACGCTATGATCGCTATTTATATTCAGACAAACZUV^ 
61SCMERI LERYDRYLYSDKQL 

241 GTTGGCCGAGACGTTTCACAAAGTGAAA?VTTGGGTTCTAGAACATGCrrAAGCTCAAGGCA 
81VGRDVSQSENW VL EHAKLKA 

301 AGAGTTGAGGTACTTGAGAAGAACAAAAGGAATTTTATGGGGGAAGATCTTGATTCGTTG 
101 RVEVLEKNKRNFMGEDLDSL 

3 61 AGCTTGAAGGAGCTCCAAAaCTTGGAGCATCAGCTCGATGCAGCTATCAAGAGCATTAGG 
121 SLKELOSLEHOLDAAIKSIR 

421 TCAAGAAAGAACCAAGCTATGTTCGAATCCATATCTGCGCTCCAGAAGAAGGATAAAGCC 
141 SRKNOAMFESI SAL.OKKD K A 

4 81 TTGCAAGATCACAACAATTCGCTTCTCAAAAAGATTAAGGAGAGGGAGAAGAAAACGGGT 
161 LQDHNNSLLKKI KEREKKTG 

541 CAGCAAGAAGGACAATTAGTCCAATGCTCCAACTCTTCTTCAGTTCTTCTGCCTCAATAC 
181 QQEGQLVQCSNSSSVLIiPQY 

6 01 TGCGTAACCTCCTCCAGAGATGGCTTTGTGGAGAGAGTTGGGGGAGAGAACGGTGGTGCA 
201 CVTSSRDGFVERVGGENGGA 

661 TCGTCGTTGACGGAACCAAACTCTCTGCTTCCGGCrTGGATGTTACGTCCTACCACrAC^ 
221 SSLTEPNSLiLPAWMLRPTTT 

721 AACGAGTAGAACTATCrCACTCTTTATAATATAATGATAATATAATTAATGTTTAATAT^ 

241 N E * 

781 TTCATAAC;VTTCAGCATTTTTTTGGTGACTTATACTCATTATTAATACC 

841 GCTAGTCATATTATATGTATGATGGAACTCCGTTGTCGAGACGTATGTACGTAAGCTATC 

901 ATTAGATTCACTGCGTCTTAAGAACAAAGATTCATATCITGGTAATGATTTCrrC^ 

961 TAn 

FIG. 6 
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AGATCTGCAA CAGTGAAAAG AGAAAACAAA ATGGACTTGA AGAGGTriTC ACAATCCCAG 

120 

* * * ♦ 

AGATAATGCT TATTCCCTAA TATGTTGCCA GCCAAGTGTC AAATTCGCTT TTTAAATATC 

180 

GATTTCTGTA TCAGTGGTCA TATTTGTGGA TCCAACGTAT TCATCATCAA GTItnCAAGT 

240 

' * * * * 

TrecrrrcAG tgcaattcta attcacacgt rrAAcrrrAA catgcatctc attataatta 

* . , ^ ^ 300 

CTTCTTCACT AAGACACAAT ACGGCAAACC TTTCAGATTA TATTAATCTC CATAAATCAA 

360 

* * ♦ ♦ 

ATAATTAACC TCATAATCAA GATTCAATGT TTCTAAATAT ATATCGACAA AATTTACACG 

420 

GAAGATTAGA TACGTATATT AGTAGATTTA GTCTTTCGTT TCTGCGATAA GATTAACCAC 

480 
* 

CTCATAGATA GTAATATCAT TGTCAAATTC CTCTCGGTTT AGTCGCTAAA TTGTATCTTT ' 

540 

•i-i-i'AAGCCTA AAAGTAGTCT ATTCGCATAT GACTTATCGT CCTAACTTTT TTTTTAATTA 

600 

* * ♦ * 

ACAAAAAAAT CGAAAAGAAA ATAATCTGTT AAATATTTTT TAAGTACTCC ATTAAGTTTA 

660 

* * * ♦ ♦ * 

GTTTCTATTT AAAAAATGCT TGAAATTTGA CAGTTATCTT CAACAATTTT GAATCATX3AG 

720 

* * * ♦ * * 

CGATCTCTAG ATACTCAGAA TTTAATCAAG ATGTCTTATC AAATTTCTTG TCACTCGAGG 

780 

ACCCACGCAA AAGAAAAGAC TAATATCATT TTTATTTGGT CTGGATATTT TTGTAGAGGA 

840 

♦ * * 

TGAAACTAAG AGAGTGAAAG ATTCGAAATC CACAATCTTC AAGAGAGCTC AAAGCAAAAA 

900 

GAAAAATGAA GATGAAGGAC TAAAGAACAA TAAGCAACTA CTTATACCCT ATTTCCATAA 
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* * * * 

AGGATTCAGG TACTAGGAGA AGTTGAGGCA AGTTNNNNNN NATTGATTCA AATTTTCATT 

TATTTTTACA ATITAArrCA CCTAAGTTAT TATGCATTTC TXIZATCATTCG TACATTTTCT 

GTATAGCGTA TTTACATATA TGAAATAAAT TAAATATXTTC CTCACGTrcC AAGTAGTTAA 

1140 

* * ♦ • 

TGAATGTCCC CACGCAAAAA AAAATCCCTC CAAATATGTC CACCTTITCT TTTCTTITrA 

1200 

ATTCCAAAAT TACCATAAAC TrTTGGTTTA CAAAAGATTT CTAGAAATTO AGGAAGATAT 

1260 

CCTAAATGAT TCATGAATCC TTCAATAATC TGAAGTTTGC GATATTTTCG ATTTTCTTCA 

1320 

♦ 

AGAGTTGCGA TATTTGTAAT TTGGTGACCT TAAACTTTTT TTGATAAAGA GTAAACGTTT 

1380 

^^^^ ****** 
TTTCTTAAAA GTAAAACTTG ATTTTATGTT TTAGGGTTCT AGCTCAACTT TGTATTATAT 

1440 

* ♦ * 

* ♦ * ♦ 

TTCTTGCAAA AAGAGTTCGT TAACTGCATT CTTCAACACT ATAAAGTGAT TATCAAAAAC 

1500 

* * ♦ * ♦ * 

ATCTTCATGA ACATTAAGAA AAACAATATT TGGTTTCGGT TAGAGCTTGG TTTTGCTTCG 

1560 

* ♦ * 

***** 

CTTGATTCAC ATACCCATTC TAGACTTTGG CATAAATTTG ATACGATAGA GAGTATCTAA 

1620 

****** 

TGGTAATGCA GAAGGGTAAA AAAAGGAAGA GAGAAAAGGT GAGAAAGATT ACCAAAAATA 

1680 

****** 

AGGAGTTTCA AAAGATGGTT CTGATGAGAA ACAGAGCCCA TCCCTCTCCT TTTCCCCTTC 

1740 

* * * * * * 

CCATGAAAGA AATCGGATGG TCCTCCTTCA ATGTCCTCCA CCTACTCTTC TCTTCTTTCT 

1800 

***♦♦♦ 

TTTTTTCTTT CTTATTATTA ACCATTTAAT TAATTTCCCC TTCAATTTCA GTTTCTAGTT 

1860 
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CTGTAAAAAG APJ\hTACACA TCTCACTTAT AGATATCCAT ATCTATTTAT ATGCATGTAT 

1920 

* ♦ ♦ * ♦ * 

AGAGAATAAA AAAGTGTGAG TTTCTAGGTA TGTTGAGTAT GTGCTGTTTG GAGAArrGTT 

1980 

♦ ****■ * 

AGATGATCTG TCCATTTTTT TCTTTTTTCT TCTGTGTATA AATATATTTG AGCACAAAGA 

2040 

***** * 

AAAACTAATA ACCTTCTGIT TTCAGCAACT AGGGTCTTAT AACCTTCAAA GAAATATTCC 

2100 

***** * 

TTCAATTGAA AACCCATAAA CCAAAATAGA TATTACAAAA GGAAAGAGAG ATATTTTCAA 

2160 

***** * 

GAACAACATA ATTAGAAAAG CAGAAGCAGC AGTTAAGTGG TACTGAGATA AATGATATAG 

2220 

*•♦*** 

TTTCTCTTCA AGAACAGITT CTCA1TACCC ACCTTCTCCT TTTTGCTGAT CTATCGTAAT 

2280 

• ♦ ♦ ♦ * * 

CTTGAGAACT CAGGTAAGGT TGTGAATATT ATGCACCATT CATTAACCCT AAAAATAAGA 

2340 

♦ •♦*** 

GAITTAAAAT AAATGTTTCT TCTTTCTCTG ATTCTTGTGT AACCAATTCA TGGGTTTGAT 

2400 

* * ♦ ♦ ♦ * 

ATGTTTCTTG GTTATTGCTT ATCAACAAAG AGATTTGATC ATTATAAAGT AGATTAATAA 

2460 

****** 

CTCTTAAACA CACAAAGTTT CTTTATTTTT TAGTTACATC CCTAATTCTA GACCAGAACA 

2520 

****** 

TGGATITGAT CTATTTCTTG GTTATGTATC TTGATCAGGA AAAGGGATTT GATCATCAAG 

2580 

****** 

ATTAGCCTTC TCTCTCTCTC TCTAGATATC TITCTTGAAT TTAGAAATCT TTATTTAATT 

translation 2640 
start . 



ATTTGGTGAT GTC AT ATATG _GATC^^GA GGAAGGTGGG AGTAGTCACG ACGCAGAGAG 

2700 

* * " . * * * * 

TAGCAAGAAA CTAGGGAGAG GGAAAATAGA GATAAAGAGG ATAGAGAACA CAACAAATCG eXOn 1 

2760 

* * * 

• * * • * 

TCAAGTTACT TTCTGCAAAC GACGCAATGG TCTTCTCAAG AAAGCTTATG AACTCTCTGT 
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* ♦ ♦ * « * 

CTTGTGTGAT GCCGAAGTTG CCCTCGTCAT CTTCTCCACT CXTTGGCCGTC TCTATGAGTA 

2880 

* * ♦ * ♦ * ■ 

CGCCAACAAC A^TACGCTT CTCCTACTCT ATTTCTTGAT CTTCTTTTCT TAATTTTAAC 

2940 

* ♦ • * ♦ ♦ 

TAAACAAGAT CCTAGITCAA ATGATAACAA AGTGGGGATT GAGAGCCAAG ATTAGGGTTT 

3000 

* * * * ★ * 

GGTTAATTTA GAAAACCAGA TTTCACTTGT TGATACATTT AATATCTCTG TAGCTAGATT 

3060 

* * « * * * 

TAGTACTCTC TCCTCTATAT ATGTGTGGGT GTGTGTGTAA GTGTGTATAT GTATGCAAAT 

3120 

* • * ♦ ♦ * 

GCAAGAAGAA GAAGAA?^G TTATCTTGTC TTCTCAAATT CTGATCAGCT TTGACCTTAG 

3180 

* ♦ ♦ * ♦ ♦ 

TTTCACTCTT TTTTCTGCAA ATCATTTGAA CCTGATGCAT GTCAGTTTCT ACAATACACT 

3240 

* * * * ♦ * 

TTTAATTTTG ACGGCCCATC AAATTTCCTA GGGTTTACTT CAGTGAACAA AATTGGGTTC 

3300 

* • * * ♦ * 

TTGACACGAT TTAGCATGTA TATATAAAAA TAGGGGATGA TCAAGACTTA TGTAACCTCT 

3360 

* • * * • * 

GTCTGGTGAA ACTAGGGACA AAGTCTACTG ATGAGTTGTC ACTAGGGATC CATTTGATCA 

3420 

•* * ♦ ♦ ♦ * 

TTTAATCCCA ACAAAAATGA AACAAAATTT TGAGAATTTA TATGCTGAAG nTTTCAACC 

3480 

* * * * * ♦ 

CTCTTrriTA AATAACTTTA TATTATGTAG ATTTGTATTT AGGGTAATTT GTCCAACTAG 

3540 

* * ♦ ♦ « * 

AAGTCCTAAA AATCAATAAA CACACGGATG ACTTTGTCTA ACATTGTATC AGTCATCAAA 

3600 

* • * ♦ ♦ * 

TGTAAAATTG TACAAATAAT GAAATTAAAG ATTTAGTCTC TTTTATTTTT TTTGTTTAGG 

3660 

* * ♦ * ■ * * 

GTGTATATAT ATATATATAT GTATATTTGT TGCATTGATA TATCAATGAG AGGGAGAGAA 

3720 

A * * ^ * * 



FIG. 7D 



SUBSTITUTE SHEET (RULE 26) 

BMSDOCID <WO S#900S02A1 I > 



wo 99/00502 1 1 / 20 PCT/US98/13208 

CTCAGAGAAG TGTCGGAAAT TAAAATGGTA CGAGCCAATT GGAATCTCTG GCATTCTGAG 

3780 

* * * * ♦ * 

CTTCATTTGT TTGTTATTAG AAAAAAAAAA AAAAAATCCT TTAAAGATAC CTTCATGATG 

3840 

* * * ♦ * * 

ACATTGAATC ATGTAATATA CACGATACAT GGTCTAATTC CTCCTCAAAC CCTAATTACC 

3900 

****** 

AATTTCGAAA CCATAATATT TACTAGTATG TTTATATATC CTTACTTTAA GACATTGrTT 

3960 

****** 

GTTTATAATA CCTTGTGAAT TAAGAAAAAA AAAAAAAAAC TTGTGGATCT ATTCAAGCCA 

4020 

****** 

TGTGTTAGAA TAAATTTATA AATTTTCTCC TCGTACTGGT CAGATATTGG TCCAAACTCC 

4080 

* * * * ♦ * 

AAAGCCTTCC CTTTTCAGGA AAAAAAACAT TTCGAAATTA ACTCTAATTA ATCAAGAATT 

4140 

* * * * * * 

TCCTACAATG TATACATCTA ATGTTTTTTC CGCGATCTTA CTTATTAGfrG TGAGGGGTAC 



4200 

****** ^w^n o 

exon 2 

AATTGAAAGG TACAAGAAAG CTTGTTCCGA TGCCGTCAAC CCTCCTTCCG TCACCGAAGC 

4260 



TAATACTCAG 



GTACCAATTT ATATTGTTTG ATTCTCTTTG ITITATCTTC TTCTTTTCAT 

4320 

******* 

TATATATATG ATCAACAAAA AATATAACCT ACAAAAAGAG AGAGTTCAAG GAAATGCATT 

4380 

* * ♦ * ^ * * 

GAAACGGTTT CGTTATGGTG TTTGAATACA TGGATTTTTG AAGjTACTATC AGCAAGAAGC 

4440 €xon 3 

* ♦ * ♦ * ♦ 

CTCTAAGCTT CGGAGGCAGA TTCGAGATAT TCAGAATTCA AATA^TAAT TCATTAACTT 

4500 

* * * *^ * * 

TTCATGAACT CTTCGATTTG GTATTAGGTC ACTTAATTTG GTGTCGGTCC AAAAGTCCGC 

4560 

* * * « * * 

TrGTAGTTTT CTTTAGAAGT TGTTTrGTTT AATGTTCATG TTTACAAATT GAACS^ATAT 

4620 exon 4 

* " • • * * * 

TGTTGGGGAA TCACTTGGTT CCTTGAACTT CAAGGAACTC AAAAACCTAG AAGGACGTCT 

FIG. 7E 
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* ♦ ♦ * * * 

TGAAAAAGGA ATCAGCCGTG TCCGCTCCAA AAA^TTAAAA TCTACGTTGC TCTXTTCTCTG 




TGTCTCTGTC TCTCTCTCTA TATATAGTCC CTTAGTTTAT ATAGTTCATC AGO 

4800 

♦ ♦ * ♦ ♦ . * 

GAGAATTTTG CAG^TGAGC TGTTAGTGGC AGAGATAGAG TATATGCAGA AGAGGpTAAG exon 5 

4860 

* ♦ * * ♦ ♦ 

AACGTTTCTC CCATTCCAAG TAATTAGATC TITCTTCGTC TTTGTGAGGG TTTGAGmT 

4920 

***** * 

CCCATAAATC ATGTGTAG3A AATGGAGTTG CAACACAATA ACATGTACCT GCGAGCAAAGj ©XOn 6 

4980 

* • * * ♦ * 

GTTAGCCACG TTCTGTTCCA AATCTTAATC TCAATATCTA CICTTTTCTT CATTGTATAA 

5040 

♦ * * * 

CTAAGATAAC GTGAATAACA AGAAAACTTT TGTTTTTGGG TTTAATAgKt AGCCGAAGGC 

5100 ■ 



GCCAGATTGA ATCCGGACCA GCAGGAATCG AGTGTGATAC AAGGGACGAC AGTTTACGAA 

5160 

♦ * ♦ * * * 

TCCGGTGTAT CTTCTCATGA CCAGTCGCAG CATTATAATC GGAACTATAT tccggtgaac 



^^^^ stop 
^2^9 codon 



CTTCTTGAAC CGAATCAGCA ATTCTCCGGC CAAGACCAAC CTCCTCTTCA ACTTGTG tTAA 

5280 

****** 

CTCAAAACAT GATAACTTGT TTCTTCCCCT CATAACGATT AAGAGAGAGA CGAGAGAGTT 

5340 

CATTITATAT TTATAACGCG ACTGTGTATT CATAGTTTAG GTTCTAATAA TGATAATAAC 

5400 

****** 

AAAACTGTTG TTTCTTTGCT TAATTAGATC AACATTTAAA TCCAAAGTTC TAAAACACGT 

5460 

****** 

CGAGATCCAA AGTTTGTCAT ACAAGATTAG ACGCATACAC GATCAGTTAA TAGATTTTAA 

5520 

♦ *♦*♦* 

GTGCCTTTTA ATATTTACAT ATAGTTGCAG CTTCGATTAG ATCATGTCCA CCAAACACTC 

5580 

^ * * 
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ACAATTAGAG ACAAGCAAAA CTATAAACAT TGATCATAAA ATGATTACAA CATCTCCATA 

* ♦ * ♦ 

AATTAATTAT GGATTACAAA AATAAAAACT TACAAAAGAT CT 

FIG. 7G 
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Sequence Range: 1 to 6138 1^/20 



10 

* 


* 


-* 


40 


50 


60 


GAATTCGTAA 


CAGAATTTAG 




TGTAAi lACC 


AGGCAAGGAC 


TCTCCAAACG 


70 

•* 


80 

* 


on 


* 


110 


120 


GATAGCTCGA 


ATATCGTTAT 




ATGATCCAAT 


•k 

A'lXJ'i'AAGCCA 


* 

TTGTrGATCA 


130 
* 


140 


1 *^n 

* 


160 

-* 


170 


180 


TCTAACATTG 


TrGGAGT^PTT' 


1 Ai IVjC 1\_VjA 


AATGATGCAT 


* 

ACCTAATCAT 


-* 

1TATTCAGTT 


190 

-A- 


w \j 


OTA 
* 


220 
* 


230 


240 


AACTATCAAG 




AAAAACCaAA 


CATTTAAATT 


-* 

CAGA'lTIXiAT 


* 

ATCACITACA 


* 


^ du 
* 


^ ^ 

270 


280 
* 


290 


300 






CCAGGCCTGC 


ATGCAACAAG 


* 

AAAAAGGAAG 


* 

AAAATAATGT 


310 

~J ^ 

* 


J z u 

<* 


330 
* 


340 


350 


360 


TAAAAATTTG 


A riR. A A T A T A r' 


i\ji'X'lA'l'l'i'l' 


TATTATATGA 


■* 

GACAGAA'rrr 


* 

GAATAAAATC 


370 
♦ 


J O w 
•* 


"> o r\ 

390 
*- 


400 


410 


420 


CTACCCAACT 




AALOjI i 1 ixjc 


AATCGCAATA 


* 

ATGAAACCCA 


■4c 


430 




♦ 


460 
* 


470 


480 


GAGnrrTTAc 




Av— AtjtAAAL. i'l" 


TCTCAAACGT 


■* 

Ci'lTAGCACT 


GTGACGTTAG 


490 
^ J \j 

* 


^nn 
-* 


510 


520 
* 


530 


540 


ATATATACAC 


AAA Ar^OTT^P" Ii 


AAl 1 rCTTCA 


AGCAAAAGAA 


* 

TC'l'l'l^JlXJGG 


AU'l'l'AAGGCA 


550 

w ^ w 




c T rv 


^ r% 

580 
* 


590 


600 


ACAAGCCAGG 


A A A n A A TV^T^ 


CCAACGCATT 


GTTACGTTTT 


* 

CATGAACCTA 


-* 

TITATTATAT 


U X v 


* 


630 
* 


640 


650 


660 


GTTCTAAGAA 


Ar;A A A A A RBT* 


ATCTCAAAGT 


AAACGTTGGA 


AATTTTCTGA 


TGAAGGGAAA 


* 


-* 


690 


700 
* 


710 


720 




IvAjOI jTrAGT 


ATCCCTATGA 


ATGGTAmXi 


* 

GAATATGTTT 


-* 

TCGTCAAAAC 


Tin 

* 


/4 0 


750 


760 
* 


770 


780 


AAAAGATtXrr 


'I'i'i^^'i'i'iTix: 


ACAAGAGTTA 


GTGATCAATA 


* 

ACTTATGCAC 


* 

TAATTAATGA 


790 


800 
-* 


810 


820 

■* 


830 


840 


GATTGGACGT 


ATACACAATT 


TGATTATGAT 


ACl'lXJAGTAA 


AAATCACCTG 


TCCTTTAATT 


850 

«■ 


860 


870 
* 


880 
• 


890 


900 


TGGAAATCTC 




CCATTTATAT 


ACTACl'lXri'r 


* 

TTCATTAAAA 


* 

TTAAA'rriXJA 
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910 920 930 940 950 960 

* * * * * * 

ATTATCAATC ATCGTTCAAT TTGATAAAGA TTTAACATTT TTTGTCACAG GGCTAGTAAA 

970 980 990 1000 1010 1020 

* ♦ ★ * ♦ * 

AGCAATCTTT ACATAATTCA TCTTTCTTAC ATATATATAT TACCTTTITC TTCATTAGTA . 

1030 1040 1050 1060 1070 1080 

* * ★ ★ * ★ 

TTCTATTTGA TTATGATTAT TTTGTCATAA AGCTAGTAAA TTAAACACTC GATATGAGAA 

1090 1100 1110 1120 1130 1140 

* * * * ★ * 

TTATATTACT TCACGCTAAT TAACTCTTAA CACAACAAGA ACTAGTGCAT ATTCAACTTT 

1150 1160 1170 1180 1190 1200 

* * * * * * 

CAAAGCATAT ACTATATATT GAGAATATAG ACCACGAAAG TCAATCAAAA GACCTACCAG 

1210 1220 1230 1240 1250 1260 

* * * * * * 

CTCTCATCAA GTTCTTTCTT GAAATGATTT TGCAGAATTT CCAACTTAAT TAATTCGACA 

1270 1280 1290 1300 1310 1320 

« * ^ ■* * * 

TGAATGTGAA AATGTGTGTT GCTCGTTAAG AAAATTGAAT AGAAGTACAA TGAAAATGAT 

1330 1340 1350 1360 1370 1380 

* * * •* ■ * * 

GAGGAATGGG CAAAACACAA AAGAGITTCC TTTCGTAACT ACAATTAATT AATGCAAATC 

1390 1400 1410 1420 1430 1440 

* * ♦ * ■ * 

TGAGAAAGGG TTCATGGATA ATGACTACAC ACATGATTAG TCATTCCCCG TGGGCTCTCT 

1450 1460 1470 1480 1490 1500 

* * ♦ * * ♦ 

GCTTTCATTT ACTTTATTAG TTTCATCTTC TCTAATTATA TTGTCGCATA TATGATGCAG 

1510 1520 1530 1540 1550 1560 

* * ★ * * * 

TTCrnTGTC TAAATTACGT AATATGATGT AATTAATTAT CAAAATAAAT ATrCAAATTG 

1570 1580 1590 1600 1610 1620 

* * * * * * 

CCGTTGGACT AACCTAATGT CCAAGATTAA GACTTGAACA TAAGAATTTT GGAAAAACTA 

1630 1640 1650 1660 1670 1680 

* * ♦ * ♦ * 

AACCAGTTAT AATATATACT CTTAAATTGC CATTTCTGAA CACAACCAAA TAATAATATA 

1690 1700 1710 1720 1730 1740 

■* * ♦ ♦ * ♦ 

TACTATTTAC AGTTTTTTTT AATTGGCAAG AACACTGAAA TCTTATTCAT TGTCTCGCTT 

1750 1760 1770 1780 1790 1800 

* ♦ * * * * 

GGTAGTTGAC AAGTTATAAC ACTCATATTC ATATAACCCC ATTCTAACGT TGACGACGAA 

1810 1820 1830 1840 1850 1860 

* ♦ ♦ ♦ ♦ * 
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CACTCATATA AACCACCXrAA ATTCTTAGCA TATTAGCTAA ATATTCGTTT AATTOGAAAT 

1870 1880 1890 1900 1910 1920 
* * * * ♦ « 

ATrrnrrrA tatataaaat gccaggtaaa tattaacgac atccaatcta tatacsgagta 

1930 1940 1950 I960 1970 1980 

* * • * 

GGGCAATAAA AAGAAAAGGA GAATAAAAAG CSGATTACCAA AAAAGGAAAG TTTCCAAAAG 



1990 2000 2010 2020 2030 2040 

***** 

GTGATTCTGA TGAGAAACAG AGCCCATACC TCTCTTnTT CCTCTAAACA TCAAAGAAAA 
2050 2060 2070 2080 2090 2100 

* * * * ^ 

ATTGGATGGT CCTCCTTCAA TGCTCTCTCC CCACCCAATC CAAACCCAAC lV,'l\,'ll'L.' i U- r 

2110 2120 2130 2140 2150 2160 

* * * * * • 

crrrcmTT tcttctttct aatttgatat TrrcTAccAC ttaattccaa tcaatttcaa 

2170 2180 2190 2200 2210 2220 

* * * ♦ 

ATTTCAATCT AAATGTATGC ATATAGAATT TAATTAAAAG AATTAGGTCT GTCATATTTG 
2230 2240 2250 2260 2270 2280 

* * * 

AGAAAATGTT AGAAGTAATG GTCCATGTTC riTCTTTCTT TTTCCTrCTA TAACACTTCA 
2290 2300 2310 2320 2330 2340 

***** 

GTTTGAAAAA AAACTACCAA ACCTrCTGTT TTCTGCAAAT GGGTITTTAA ATACTTCCAA 

2350 2360 2370 2380 2390 2400 

* ♦ * 

AGAAATATTC CTCTAAAAGA AATTATAAAC CAAAACAGAA ACCAAAAACA AAAAATAAAG 

2410 2420 2430 2440 2450 2460 

* * . . 
TTGAAGCAGC AGTTAAGTGG TACTGAGATA ATAAGAATAG TATCTTTAGG CCAATCAACA 

2470 2480 2490 2500 2510 2520 

* • * * * 

AATTAACTCT CTCAT^TTC ATCTTCCCAT CCTCACTTCT CTTTCTrrcT GATATAATTA 

2530 2540 2550 2560 2570 2580 ^ 

* —1 * * * * * 
ATCTTGCTAA GCCA^STATG GTTATTGATG ATTTACACIT TTnTTAAAA GTITCTTCCT 

2590 2600 2610 2620 2630 2640 

* ♦ * * *'WT.vr 

TTTCTCCAAT CAAATTCTTC AGTTAATCCT TATAAACCAT TTCTrrAA-rc CAAGGTXSTIT 
2^50 2660 2670 2680 2690 2700 



GAGTGCAAAA GGATTTGATC TATTrCTCTT GTGTTTATAC TTCAGCTAGG G&TATAGAA 

translation 

start 2^1° 2720 2730 2740 2750 2760 

^3AG0GTC GTCCGACTAA TGAACTAGCA GAGAGCAGCA AOAAmXAGG GAGAGOGAA^ 

FIG. 8C 
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2770 2780 2790 2800 2810 2820 

****** 

ATAGAGATAA AGAGGATAGA GAACACTACG AATCGTCAAG TCACTTTCTG CAAACGACGC 

2830 2840 2850 2860 2870 2880 

***** * 

AATGGTTTAC TCAAGAAAGC TTATGAGCTC TCTGTCTTGT GTGACGCTGA GGTTGCTCTT 

2890 2900 2910 2920 2930 2940 
* * * * * * 

GTCATCTTCT CCACTCGAGG CCGTCTCTAC GAGTACGCCA ACAACAGpTA CACATCTITT 

2950 2960 2970 2980 2990 3000 

****** 

AGCTAGATCT TGATTTTGTT GAAmTTTT TCTAGAATAA AGTITCGACT CTTCTGGTGG 

3010 3020 3030 3040 3050 3060 

****** 

GTmrCAAT CTTTATGGTC TCTTTATAGT TTTnTCCTT AGTTTCTCTG AAGCTCAAAT 

3070 3080 3090 3100 3110 3120 

****** 

CTCTTTAAAA ATCCCCAAAA TrAGGGTTTG TTTAAAACTA GGGAACCCTA CTTTAACTTC 

3130 3140 3150 3160 3170 3180 

* * * * * ♦ 

TTTCTCriTAG TA/^AAAAGCA GTGAGGGTCT TCTCTGATCA TTAATTAGCA TCCCCCATAC 

3190 3200 3210 3220 3230 3240 

* * * * * 

CTTGTTCCAG TCACTTTITC TCCACAAATC CTTATAACAG TATCTATATA TGTATCTATT 

3250 3260 3270 3280 3290 3300 

***** ♦ 

TATGTCAGTT TGTACAAGAC ACTTCGATCA ATTTGATGAC CCATCAAGTT TTATTTCTGC 

3310 3320 3 330 3340 3350 3360 

****** 

AGATTGATCA TTAGGTTTCC ATCATAGTAA TGAAAAAGTA GGGTTCTTGA TAAAA1TATA 

3370 3380 3390 3400 3410 3420 

******* 

ATAATATATA TTATTTGGCT ATATAAAAAA GCTATGTAGA TTCCTTAAAA ATTGATTCAC 

3430 3440 3450 3460 3470 3480 

***** * 

TAGGGAGAGA CTAGTAGGTG TTTGTCTTCT GACACTTCTC TAATCTTTTG GTGAATCCTT 

3490 3500 3510 3520 3530 3540 

***** * 

TTGTTAAATC AAGAAAATGA ATCAGGGACA AAGCTTATTG TTGAGTCACT TAATTAATCA 

3550 3560 3570 3580 3590 3600 

***** * 

TCCGATCCAT CAATCAAGAA AAATAACGAA ACAGAAAATT TTGATTTTTG ATTGTTATTT 

3610 3620 3630 3640 3650 3660 

***** * 

TCTCCACTTC AAGITGGGGA CTTGTCATTT CCGTTTTTCT ATACGTITCC AGCTATTAAC 

3670 3680 3690 3700 3710 3720 

***** * 
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AGCTCATGTT CATTTCACCA TriTGATrAT TTTAAAGATA AATCTrTTCA 

3730 3740 3750 3760 3770 3780 

AAAATATTCT TTTrATTTCC TTGC3CTAGTT AATACTATAA ITCAGGITCA TCTAIGACtI 
3790 3800 3810 3820 3830 3840 

* * * * 

TAATCTATAA GTCAAGTCTC ATATCA-TOGA TCTAAGTTAA AACTAGTAAA TrTOTAGTIT 
3850 3860 3870 3880 3890 3900 

^ ^ ^L- 

CAATC3TGAAC TTTCACAACG ACTAAAGAAC TGATCTQAAG TTTATAATCG ACATCACTAA 
351° 3920 3930 3940 3950 3960 

* * * * 

TTTGATTAAC AAAAGAGGAA TGCATTATGT ATGTAGAAAC ATCTCATATA TATATOriTC 
35"^° 3980 3990 4000 4010 4020 

TATTATCAAA AGTGTAGTTA ACnTCTTAT TTCAAACACC CTCATCCTrl- AGTAGTATci 
*°30 4040 4050 4060 4070 4080 

TAcrrrTCAC atttctcaac rrcAGcrrrc cattatacaI cagcacaatc TAAATTAcrJ- 

^°5° 4100 4110 4120 4130 4140 

GTATATGAAT ATGAAAGCAT AACGTTATGC AAAGATTRrT AGCnTTCTT TTTCTCTnT 

^1^° ''IfiO 4170 4180 4190 4200 

GCAAAAGATT TACAAATATC ATGTrCTTGG TAAAAACATA CTrcCCTCAG CCACATATCC 

4220 4230 4240 4250 4260 

ATGTAAATGT AATGTTCAAA TATTAATTCA GGAAAAACaI AGAAGAAGcI AAATTAGCT^ 

42"^° 4280 4290 4300 4310 4320 

ctagagtagg gaatctattg acttgacctg aaaatcactt- crnrrcrrA aagcctagta 

^330 4340 4350 4360 4370 4380 

GTGAATrTTT TAATCTAATT AGGCCAAAAT ATATACTAG^ CTAAAATAtI AirTOGATI^ 

4390 4400 4410 4420 4430 4440 

TGTGTCGTAC ATAAATTGGG ACCAATTCCA ATTAACTAAG AGCATATCCA ATTCAAATTC 

4460 4470 4480 4490 4500 

TmTATnr cttctccgat rrccTAcn^ TTKn-mx^ ATCrmcAl attaggattI 

"1? 4520 4530 4540 4550 4560 

cAcmrrrc gggaagtaca cattagggt^ ttctcgaact tksattatac atatatatat 

"''^ 4580 4590 4600 4610 4620 

ATATATATAT ATATAACTTT GTGAGATGT^ ACTGTTAATA GATAATAGG^ AATAACAAtI 
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* * 

ATATCCAAAA AAGAAGGCGC 

4690 4700 

* « 

rrTGTCGGTT GAATTTAAGG 

4750 4760 

* * 

CTTTCTGTGT GTnTGTATA 
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4650 4660 
* * 

AAACAAATCA TATACTATAT 
4710 4720 
TTTGGCGTAC AAACTTTGTT 



4670 4680 

♦ * 

GGTACTGGTC CATTCACTAT 

4730 4740 

* * 

TCAAACXmr ATTATTCCGT 



4770 
* 



4780 



4810 4820 

TATATATATA TATATATATA 

4870 4880 
* * 

ACAGTTATAG TTTCGTGTGT 



TCCAGAAGAT AAAAATATCA 

4830 4840 

* ♦ 

TATATATATT TTTCTCTTCT 

4890 4900 

* * 

CTTTGTTITA CTTGTGGTX3G 



4790 4800 
♦ * 

ATTrCTTTAA CGACTTCATA 



4850 4860 

* * 

GGTnTAGTG TTTGAATCCA 

4910 4920 

* ♦ 

TTTAAGTTTG AGATTTTCAC 



4930 



4940 



4950 



4960 



4970 



4980 



CGATIGCATC TATTTACATA TATAGCTACC ACAAAAAAGA TTOCATrrrA AAATCTTTTC 



4990 



5000 



5010 



5020 



5030 



5040 



CrrTGTGTGA ATGTTGATGA AGPCTGAGAG GAACAATAGA AAGGTACAAG AAAGCTTCCT 



5050 



5060 



5070 



5080 



5090 



5100 



exon 3 



CCGACGCCGT TAACCCTCCG ACCATCACCG AAGCTAATAC 'K:AG3TrAGC TnTAATTAA 



5110 



5120 



TACACCTAGC TAGCTAGTTC 



5130 5140 
* * 

GTTAATTACT TAATTTCTTC 



5150 



5160 



TTCTTTTAGT TATCTGACCT 



5170 



5180 



5190 



5200 



5210 



5220 



rrrrrrcAcc tcttgtaaca atgatgggat cgaaattgat 



5230 



5240 



5250 



CGTCTAAACT CCGGAGACAG 



5260 

ATTCGGGACA nCAGAATTT 



GAAG^CTAT CAGCAAGAGG 

5270 5280 
♦ * 

GAACAGACAC ATTCTTGGTG 



5290 



5300 



AATCTCTTGG TTCCTTGAAC 



5310 5320 
* * 

TTTAAGGAAC TCAAGAACCT 



5330 



5340 exon 4 



TGAAAGTAGG CTTGAGAAAG 



5350 



5360 



GAATCAGTCG TGTCCGATCC 



AAGAA^ 



5370 
AGpTAC 



5410 



5420 



1TGAATATAT AlCCATCTGA 



5380 
ATCACTAACT 

5430 5440 
* * 

TTCTTGCCCG TTATATTTCG 



5390 



5400 



CTCCATCAAT CTCCTTATCA 



5450 



5460 



TTTTICTCTC CACbACGAGA 



5470 




5480 



5490 



5500 



5510 



5520 exon 5 



TGrrAGTTGC AGAGATTGAA TACATGCAAA AAAGgIgtAAA AGTAAAACCT ATCTTCCITC 



5530 



5540 



5550 



5560 



5570 



5580 
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ACAATGAACT ACCCCTACTT TATTAGCAAC TTCTCTTTCT GATGATCATC TTTTTTATTT 

5590 5600 5610 5620 5630 5640 

TCTGTTGTCG CTTGCATTGT AG^AA'TCGA GCTGCAAAAC GATAACATGT ATCTCCGCTC 

5650 5660 5670 5680 5690 5700 exon 6 

_ • • * ♦ ♦ * 



CAAGpTTTTA TACATAACTC mTTGGCAT nTTGATCAT CATTTTTTTC CGGTAGACAA 

5710 5720 5730 5740 5750 5760 

* * * * ♦ * 

TCTCTTGATG TGCAAATTCT AAATATCTCT GCAG^TACT GAAAGAACAG GTCTACAGCA 

5770 5780 5790 5800 5810 5820 

****** 

ACAAGAATCG AGTGTGATAC ATCAAGGGAC AGTTTACGAG TCGGGTGTTA CrTCTTCTCA exOfl 7 

5830 5840 5850 5860 5870 5880 

**«**« 

CXrAGTCGGGG CAGTATAACC GGAATTATAT TGCGGTTAAC CTTCTTGAAC CGAATCAGAA 



5890 5900 5910 5920 5930 5940 

Stop 

TTCCTCCAAC CAAGACCAAC CACCTCTGCA ACTTGTl jTCA 



* * * 5;rnn * * 

TTCAGTCTAA CATAAGCTTC 



5950 5960 5970 5980 5990 6000 

* ♦ * * - ♦ * 

TTTCCTCAGC CTGAGATCGA TCTATAGTGT CACCTAAATG CGGCCGCGTC CCTCAACATC 

6010 6020 6030 6040 6050 6060 

****** 

TAGTCGCAAG CTGAGGGGAA CCACTAGTGT CATACGAACC TCCAAGAGAC GGTTACACAA 

6070 6080 6090 6100 6110 6120 

****** 

ACGGGTACAT TGTTGATGTC ATGTATGACA ATCGCCCAAG TAAGTATCCA 

6130 
* 

GAACGTACGT CCGAATTC 
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